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i ■ We investigate trends between the recent star formation history and black hole growth in 

galaxy bulges in the Sloan Digital Sky Survey (SDSS). The galaxies lie at 0.01 < z < 0.07 
where the fibre aperture covers only the central 0.6-4.0 kpc diameter of the galaxy. We find 
strong trends between black hole growth, as measured by dust-attenuation-corrected [O III] 
luminosity, and the recent star formation history of the bulges. 56% of the bulges are quies- 
cent with no signs of recent or ongoing star formation and, while almost half of all AGN lie 
within these bulges, they contribute only ~10% to the total black hole growth in the local 
Universe. At the other extreme, the AGN contained within the ^4% of galaxy bulges that are 
undergoing or have recently undergone the strongest starbursts, contribute at least 10-20% of 
the total black hole growth. Much of this growth occurs in AGN with high amounts of dust 
extinction and thus the precise numbers remain uncertain. The remainder of the black hole 
growth (>60%) is contributed by bulges with more moderate recent or ongoing star forma- 
tion. The strongest accreting black holes reside in bulges with a wide range in recent SFH. 
We conclude that our results support the popular hypothesis for black hole growth occurring 
through gas inflow into the central regions of galaxies, followed by a starburst and triggering 
of the AGN. However, while this is a significant pathway for the growth of black holes, it is not 
the dominant one in the present-day Universe. More unspectacular processes are apparently 
responsible for the majority of this growth. 

In order to arrive at these conclusions we have developed a set of new high signal-to-noise 
■ ratio (SNR) optical spectral indicators, designed to allow a detailed study of stellar populations 

which have undergone recent enhanced star formation. Working in the rest-frame wavelength 
range 3750-4 150A, ideally suited to many recent and ongoing spectroscopic surveys at low 
and high redshift, the first two indices are equivalent to the previously well studied 4000A 
break strength and HS equivalent width. The primary advantage of this new method is a greatly 
improved SNR for the latter index, allowing the present study to use spectra with SNR-per- 
pixel as low as 8. The third index measures the excess strength of Ca ll(H&K), which is 
particularly sensitive to the transition of a post-starburst spectrum from A to F stars, and 
allows the degeneracy between time of burst and strength of burst to be broken. 

Key words: galaxies:bulges, active, stellar content; methods:statistical 



1 INTRODUCTION 

Galaxy spectra contain a wealth of information on the present 
and past star formation of galaxies, in the form of stellar contin- 
uum shape, stellar and interstellar absorption, and nebular emission 
* vwild@mpa-garching.mpg.de lines. In particular, the region around the 4000A break provides us 
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with powerful diagnostics of the luminosity-weighted mean stel- 
lar age and the fraction of stars formed in recent (<1 Gyr) bursts, 
from the height of the break and the relative strength of the hy- 
drogen Balmer absorption lines respectively. While considerable 
progress has been made in extracting the global star formation his- 
tory (S FH) of galaxies from the i r present day inte g rated optical 
spect r a jKauffmann et ail |2003bl ; IPanter et alj|2003l ; lOcvirk et all 
2006; ICid Fernandes et al. 2006), the potential for recovering de- 
tailed recent SFHs from spectra in these large datasets has remained 
relatively unexplored. 

Determining the recent (< 1 Gyr) star formation history of 
galaxies has considerable importance for understanding the ef- 
fects of external and internal physical processes on the evolution 
of galaxies, since this timescale corresponds to only a few galaxy 
dynamical times. As a result of such processes, galaxies are ex- 
pected to undergo strong fluctuations in their star formation over 
short timescales, which may in turn be linked to other aspects of 
galaxy evolution, such as the build-up of stellar bulges. 

Sharp transitions in the star formation rate of galaxies leave 
clear imprints on the integrated light of the stellar population, as the 
balance of light contributed by stars of different masses changes. 
Spectroscopic research in this area has focussed in particular on 
so-called E+A (or K+A) galaxies - an early type galaxy spectrum 
and superposed A- type stellar spectrum, resulting in strong Balmer 
absorption lines. An excess population of A stars results when the 
corresponding O and B star population, which dominates the light 
during ongoing star formation, has died out or is not visible. There- 
fore, the relative fraction of A to O stars provides information on 
changes in the star formation rate of galaxies over the last ~1- 
2 Gyr. 

Studied spectroscopically since the early 1980's, these ob- 
jects have been suggested to be caused by either a short burst of 
star f o rmation in the last Gyr dSparke et alj 19801 : iDressler & Gunnl 
1 19831 ; IDressler et ail |2004| ; iNolan et all 120061) , recent truncation 
of star fo rmation caused by strippin g of gas in cluster envi- 
ronments dPoggianti et alj |l999. 2004 ), or simply obscuration of 
younger, hotter stars by dus t jSmail et al.l 1 19991 ; iMiller & Owenl 
l200ll ; IPoggianti & Wull200b1) . All the se scenarios may contribute 
to the E+A galaxy population (see also lHogg et al.l200q) , although 
it is generally believed that the objects with very strong Balmer ab- 
sorption lines are incompatible with pure truncation o f star forma- 
tion a nd are likely to be true "post-starburst" objects (Balo gh et al.l 
120051) . 

The E+A phenomena is also oft en observationall y linked 
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implications for the starburst-AGN connection; it is this question 
which we primarily aim to address in this paper. Recent numerical 
simulations have suggested a scenario in which major mergers 
between galaxies provide the fuel and a disrupted gravitational 
potential conducive to both the build up of the galaxy bulge 
through star formation and t he fuelling of t h e cent ral black hole. 
Di Ma tteo et ail d2005h and iHopkins etall d2006l) suggest that 
triggering by such major mergers can account for the properties of 
the entire QSO population. Direct observational confirmation of 
this scenario remains elusive however, and the causal connection 
between starburst and AGN activity is still unconstrained. Further- 
more, alternative theories for the bui ld up of black hole m ass in a 
less spectacular fashion do exist (e.g. lKing" Pringle 2007). To be 
able to accurately describe the recent SFH of galaxies, in terms of 
the age and strength of recent starbursts, and to distinguish between 



truncation of star formation, starbursts or starbursts followed by 
truncation, will allow such theoretical scenarios to be thoroughly 
tested. 

Our ability to measure recent fluctuations in the star forma- 
tion rate of galaxies has in part been restricted by the high signal- 
to-noise ratio (SNR) required of spectra in order to measure the US 
Balmer line with accuracy. The H<5 line is important as the strongest 
Balmer absorption line not to suffer irreparably from emission line 
infilling in all but the galaxies with the highest star formation rates. 
However, this traditional diagnostic is no longer adequate for our 
purposes, being unsuitable for detecting all but the strongest post- 
starburst galaxies in current high redshift spectroscopic surveys, or 
for detecting small and/or relatively old burs ts without employin g 
the stacking spectra to obtain mean results ( IDressler et aLl l2004). 
The intrinsic dependence of the H<5 line strength on luminosity 
weighted mean stellar age also causes populations defined through 
a simple cut on equivalent width (EW) to be biased towards the 
youngest or the very strongest post-starburst systems. For these rea- 
sons we develop in this paper a new set of spectral indicators specif- 
ically designed to quantify the recent star formation of galaxies. 
These new indicators are based on a Principal Component Analysis 
(PCA) of the spectral region 3750-4150A of a se t of model galax- 
ies created from the Bru zual & Charlotl ( 120031 hereafter BC03) 
GALAXEV spectral synthesis code, a proportion of which have 
undergone stochastic starbursts. PCA is traditionally used to iden- 
tify correlations and variance in datasets and has often been a pplied 
to galaxy spectr a ( Connol ly et alj 1 19951; Id azebroo k et al] 1 19981 : 
iMadgwick et al JI2002L 120031: lYip et al.ll2004l:lFerreras et alfcoOoT) . 
By applying PCA to spectral data, collections of correlated fea- 
tures, such as the Balmer absorption lines, are easily extracted and 
quantified as a single parameter. Allowing all Balmer absorption 
lines to contribute to measuring the Balmer absorption strength 
greatly improves over the SNR obtained from measuring a single 
line alone. 

As we shall show, the first three principal components output 
from our analysis are easily interpreted as: 1) 4000A break strength 
(correlated with Balmer absorption line strength); 2) excess Balmer 
absorption; 3) excess Ca II (H&K) absorption. Previous work has 
made extensive use of D n (4000) and H<5 line strength to recover 
underlying physical parameters of galaxies in the Sloan Digital Sky 
Survey (SDSS) such as stellar mass and age (e.g. Kauffm ann et al] 
l2003bl) . and the first two principal components simply measure the 
two features already known to vary most in this spectral region. 
The third component, Ca II (H&K) strength, is used less often as a 
SFH indicator. However, it has been identified prev iously as a pow- 
erful diagnostic of A to F star fractions in galaxies dRosel 19851) and 
for breakin g the degeneracy between a ge and strength of a recent 
starburst jLeonardi & Rosell 19961 l2003h . 

In this paper we focus on the method and the qualitative trends 
of AGN properties with recent star formation history of their host 
bulges. In future papers we will use the new method to derive quan- 
titative parameters such as starburst strengths and ages. The outline 
of the paper is as follows. In Section [2] we introduce the SDSS 
galaxy sample to which we will be applying our new method. Read- 
ers with no further interest in the method may skip to Section|7] In 
Section [3] the new method is presented in detail, including details 
of the input models, the PCA method and the physical interpreta- 
tion of the resulting eigenspectra. The new indices are applied to 
the SDSS dataset in Section [4] and related to traditional indices to 
allow an easy reference point for interpretation. Duplicate observa- 
tions of SDSS galaxies are used to show the improvement of our 
new indices over the old ones. In Sectionf5]we compare BC03 stel- 
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lar population model tracks to our sample of SDSS galaxies and 
provide a simple toy model with which to visualise the evolution 
of the galaxies in the principal component planes. In Section[6]the 
BC03 models are used to investigate potential biases and degen- 
eracies involved in the interpretation of the new indices. In Sec- 
tion |7] we investigate trends of recent star formation history in the 
bulges of high surface mass density galaxies with global galaxy 
morphology, dust content and AGN strength. The contribution of 
AGN hosted by each class of bulge to the overall present day black 
hole accretion rate is presented. Our results are discussed in Section 

m 

Where necessary we assume the standard cosmology with 
Qm = 0.3, f^A = 0.7 and h = 0.7. Vacuum wavelengths are 
quoted throughout. 



This results in 33913 galaxies. A further 399 galaxies with ex- 
tended regions of bad pixels in the region of interest are removed 
from the sample leaving 33514 galaxies. Photometric properties 
used in this paper are taken directly from the SDSS catalogue; spec- 
troscopic continuum parameters used, such as H5a and D„(4000), 
are available through th e SDSS-MPA value added catalogue web- 
pagefl and described in lTremonti et al.l d2004l) . 

Because we are interested in the average properties of different 
types of galaxies within our sample, it is important to allow for the 
fact that the sample is magnitude- and not volume-limited. We do 
this by weighting each galaxy contributing to a mean or total quan- 
tity by 1/Vmax, where Vmax is the maximum volume in which 
the g alaxy may be observed in the spectroscopic survey dSchmidtl 
Il968t) . 



2 THE SDSS GALAXY SAMPLE 

The Sloan Digital Sky Survey (SDSS) spectroscopic galaxy cat- 
alogu e, with more than 500 000 spec tra in its fourth data release 
(DR4. [Adelman-McCarthv et al.l2 006). provides the ideal database 
for a first application and full testing of any new spectral analy- 
sis method. We concentrate on the bulges of low redshift galax- 
ies. With 3" diameter fibres, the SDSS spectra probe only the cen- 
tral few kpc of low redshift galaxies: 0.6 - 4kpc diameters for 
0.01 < z < 0.07. We use this "fibre aperture effect" to our ad- 
vantage to allow us to study the recent star formation history of 
galaxy bulges close to the central active galactic nucleus (AGN) 
that is present in many of these systems. 

Our sample is selected to be spectroscopically confirmed 
galaxies in the redshift range 0.01 < z < 0.07. The upper red- 
shift limit of 0.07 fits well with the characte ristic radius of bulge - 
dominated galaxies in the SDSS of ~4.4 kpc dBernardi et ail2 003). 
We further restrict the galaxies to have stellar surface mass densi- 
ties (jttj = 0.5M* iZ /7r x R 5 o,z 2 ) greate r than 3 x 10 s M p kpc" 2 , 
where M* is stellar mass measured by Kauffmann et al. (2003c) 
and Rso, z is z-band half-light Petrosian radius. Below this surface 
density, the disk dominates the light and the measured velocity dis- 
persion within the fibre aperture does not relate to the central veloc- 
ity dispersion of the stars around the black hole and thus cann ot be 
used to determine black hole mass dKauffmann e7aill2003ch . We 
expect this cut to have minimal impact on our final analysis of total 
[O III] luminosities of AGN hosted by each type of stellar popu- 
lation, as low surface density systems contribute very little to the 
volume-averaged inte grals of black hole mass and accretion rates 
( Heck man et al.l2 004). A lower limit on velocity dispersion is set at 
70 km s _1 , because the spectral resolution of the SDSS means that 
reliable velocity dispersions cannot be obtained for lower values. 
70 km s" 1 corresponds to a black hole mass of 10 6 - 3 M using the 
observed co rrelation between bul ge velocity dispersion and black 
hole mass of lTremaine et alj d2002l) . 

To summarise, our sample contains SDSS DR4 main sample 
galaxies that match the following criteria: 

• Spectroscopically classified as a galaxy 

• With matches in the photometric catalogue 

• Zwarning flag = 

• Redshift 0.01 < z < 0.07 

• Stellar surface den sity /i» > 3 x 10 s M kpc -2 (M, from 
iKauffmann et alj|2003bh 

• Velocity dispersion > 70 km s -1 and velocity dispersion mea- 
sured at > 3<t significance 

• Spectral SNR in the g-band SNR g > 8 



2.1 Emission line analysis of SDSS spectra 

We use the Baldwin e t al.l dl98ll BPT) method to discriminate be- 
tween narrow emission lines that are primarily caused by ongoing 
star formation or a central AGN, using the flux ratios [N Il]/Ha 
and [O IIl]/H/3. The sample is divided into four primary sub- 
samples: 'pure AGN' (3165), AGN including composite objects 
(AGN+composite, 11751), star forming (SF, 6357) and unclassi- 
fied objects (unclass, 15406). The emission line ratios of pure- AGN 
place t hem above the stringent theoretical criterion of lKewlev et al.l 
d200lh : 

log([0 III]/H/3) > 0.61/log([N II] /Ha) - 0.47 + 1.19 (1) 

with all 4 lines detected at > 3a or, if H/3 is too weak to reach 
the requisite SNR, the [N Il]/Ha flux ratio is greater than 0.2. 
The AGN+composite' sample has emission lines that lie above the 
observa tionally determined demarcation line of IKauffmann et all 
d2003iJ) : 

log([OIII]/H/3) > 0.61/log([NII]/Ha) - 0.05 + 1.3 (2) 

or have [N Il]/Ha flux ratio greater than —0.2. Non-AGN have lines 
that are dominated by star formation. The remainder are 'unclas- 
sified', primarily because of non-existent or very weak emission 
lines. 

The emission line measurements a re provided on the SDSS- 
MPA webpages and details are given in Brinchmann et al. 
errors have been rescaled according to the information provided 
on the website. Briefly, spectral synthesis models are used to fit 
the stellar continuum, including absorption features, and emission 
lines are measured from the residual of the data and model fit, thus 
contamination of the nebular emission lines by stellar absorption is 
accounted for. 



2.1.1 Correction for dust attenuation 

Dust attenuation of the emission lines is corrected for when Ha and 
H/3 are measured with SNR of > 3a and the flux in the H/3 line is 
greater than 4 x 10 _16 erg s _1 cm -2 , using the Balmer decrement 
method. The latter cut on H/3 flux was placed after investigation of 
objects with very high balmer decrements, which were often found 
to have apparently underestima ted errors on H/3 . We assume an 
intrinsic Case B ratio of 2.87 dOsterbrock51l989f) . We note that a 
slightly higher ratio of ~ 3 is generally accepted to apply in AGN, 

1 http://www.mpa-garching.mpg.de/SDSS/ 
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and this will cause a small systematic overestimate of our dust at- 
tenuations where emission lines are dominated by AGN (of ~ 15% 
at [O III]). However, separating the relative contributions of star for- 
mation and AGN to the lines is a difficult topic and we use a single 
ratio for simplicity. 

The form of the dust attenuation in star forming regions (birth 
clouds) and narrow line regions around AGN is not well known, 
but has a significant impact on our results. We therefore choose 
to present our results using two different attenuation corrections. 
The first is a single power-law of the form t\ oc A~ 0,7 which 
pro vides a good fit to th e UV to IR continua of starburst galax- 
ies JCharlot & Fail koOO). However, the attenuation caused by the 
birth clouds from which the emission lines originate will be more 
screen like (less grey) in form and this is not accounted for in the 
simple single-power law prescription. This results in dust-corrected 
[O III] emission line luminosities in average starforming galaxies 
that are a factor of several higher than those derived using a stan- 
dard Milky-Way like absorption curve. We therefore introduce a 
second prescription, designed specifically for the purposes of cor- 
recting emission lines: 



2.2 Stellar populations of broad line AGN 

Our galaxy sample has been specifically selected to omit broad line 
(Type 1) AGN. This is achieved by an automated spectroscopic 
classification algorithm within the standard SDSS pipeline. Uncov- 
ering the true stellar populations of broad line AGN is a substantial 
topic in its own right, due to the contamination of the stellar light by 
continuum light from the AGN. We have made no attempt in this 
paper to study broad line AGN, and refer t he in terested reader to 
IKauffmann etafl d2003j) . ISao et al.1 d2005h , and lYipetal] d2004h 
for studies of their stellar populations and contribution to the [O III] 
luminosity density in the SDSS. Within the unified model of AGN, 
Type 1 (broad line, unobscured) and Type 2 (narrow line, obscured) 
AGN are expected to differ only in the sightline by which they are 
viewed. In this scenario the stellar populations of Type 2 AGN are 
expected to be representative of those of Type 1 AGN with the 
same [O HI] luminositie s. This was the conclusion arrived at by 
IKauffmann et alj d2003 al) and, while the precise balance of stellar 
population classes as derived in this paper of Type 1 AGN relative 
to Type 2 AGN is obviously of great interest, it is beyond the scope 
of the present paper. 



where tv is the total effective optical depth in the 1/-band and n is 
the fraction of total ry caused by the ambient ISM. We set /i = 0.3 
based on observed relations between UV continuum slope and Ha 
to H/3 emission line ratios. Although the exponent in the first term 
is not well constrained by observations of external galaxies, —1.3 
lies between that observed in line -of-sight observation s to stars in 
the Milky Way, LMC and SMC dCharlot & Faifeood) . This new 
dust prescription thus accounts for the attenuation of the emission 
lines within the birth clouds (first term) and the attenuation as the 
light passes though the ISM [second term, see da Cunha et al. (in 
preparation) for further details]. 

Due primarily to insufficient SNR of the weaker H/3 emission 
line, it is only possible to correct 26% of our galaxies for dust at- 
tenuation of the emission lines. The majority of the objects without 
dust attenuation corrections are quiescent systems with little ongo- 
ing star formation and thus very weak emission lines; this accounts 
for 73% of the galaxies we are unable to calculate corrections for, 
using the classification of Section lTTI These old stellar populations 
are known to contain very low quantities of dust: we find they have 
mean z-band attenuations consi stent with zero (derived fro m model 
fits to the stellar continuum by Kauffmann et al. (2003b)). As the 
star formation rate of systems increases, the percentage that we can 
correct for dust attenuation rises, as does their mean dust content. 
For the galaxy-bulges with normal levels of ongoing star forma- 
tion (mean log specific star formation within the fibre of -10.26, 
see Section im ), 68% are corrected for dust attenuation and for the 
starbursting galaxies 98% are corrected. We therefore believe that 
the primary results of this paper are not biased by the objects for 
which dust attenuation correction of the emission lines is not pos- 
sible. However, as will be discussed later, there is a population of 
objects for which it is likely that we are unable to measure H/3 in a 
large fraction due to extreme dust attenuation. While this bias has 
been included in our discussions where appropriate, we note that 
these systems account for only ~1.5% of those for which a Balmer 
decrement is not obtained. 



3 METHOD: BUILDING THE NEW INDICES 

Our primary goal is to improve upon the traditional method of iden- 
tifying post- starburst galaxies through the strength of H<5 absorp- 
tion and [O II] emission. We therefore concentrate on the Balmer 
line region of the galaxy continuum, which, in the optical, contains 
the greatest amount of information on the relative fractions of O to 
A stars. By selecting a small wavelength region, we also minimise 
the effects of internal dust attenuation in the galaxies, and can eas- 
ily uncover and isolate discrepancies between models and data. 

In Figure Q] a selection of stellar spectra are plotted in the 
wavelength range 3750-4200A. The defining wavelength regions 
of two indices traditionally applied in this region are indicated and 
prominent absorption lines marked. With decreasing stellar tem- 
perature the 4000 A break strength [as measured by D n (4000)] in- 
creases and the H5A4102.9 absorption line first strengthens and 
then weakens. A strong UV continuum is evident in the hottest 
stars. Each stellar type has a characteristic main sequence lifetime, 
~0.5 Gyr and 3 Gyr for A and F stars respectively; it is these dif- 
ferent lifetimes that allow us to study the star formation histories of 
stellar populations. Ca II (H&K) strength increases rapidly through 
A stars to F stars, the ratio of Ca II (H)+He to Ca II (K) thus pro- 
vides the decisive fine age indica tor for young (< 1 Gyr) stellar pop- 
ulations dlxonardi & Rosdl 19961) . 

The main principle behind our method is to use all the spec- 
tral features in the wavelength region around the 4000A break, to 
quantify the young stellar content of the galaxy based on continuum 
shape and absorption line strengths. We achieve this by searching 
for patterns of pixels which vary most within a set of model spec- 
tra and, from these patterns, defining three new spectral indices. 
From previous work on galaxy spectra we expect the main varia- 
tion in this wavelength range to be the strength of the 4000A break 
which is well known to be a powerful stellar age indicator. The sec- 
ond variation is expected to be related to US absorption strength, 
which gives an additional constraint on the relative amount of any 
additional young stellar component. Although our original aim was 
simply to create a high SNR replacement for H<5, our procedure also 
identified Ca II as a clearly interpretable third axis of variation. 

The method is based on a principal component analysis 
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Figure 1. Example main sequence stellar spectra in the wavelength region 
of the 400()A break selected from the Pickles stellar library lPicklel fl998[) . 
Grey shaded areas indicate the blue and red waveleng th regions from which 
D n (4000) is calculated as defined by iBalogh et alj 1 19991); hashed areas 
indica te regions from which the US a index is calculated (Worfhey et al. 
[_1994|). The stars are ordered by temperature, from hot to cold (i.e. short- 
est to longest main sequence lifetime). The 4000A break strength increases 
monotonically with decreasing temperature, while H<5 equivalent width de- 
creases for the cooler stars, thus both provide powerful age indicators for 
young to intermediate age stellar populations. This paper works with the 
wavelength region 3750-4150A. 

(PCA), a standard multivariate analysis technique used in many 
quantitative science fields, and a mathematical introduction is in- 
cluded in Appendix [A] It is however important for the reader to 
have a basic visualisation of the process. A spectrum that contains 
M pixels of flux values is normally thought of in terms of a 1- 
dimensional array; it can alternatively be imagined as a single point 
in an M dimensional space. A collection of spectra then becomes 
a cloud of points in this space, rather than a 2D array of flux val- 
ues. PCA searches for the lines of greatest variance in the cloud 
of points representing the spectra, with each line described by a 
vector that is called a "Principal Component" or "eigenspectrum". 
The principal components are constrained to be orthogonal to one 
another and placed in order of the amount of variance within the 
dataset that they account for. The principal components form a new 
basis, rotated from the original, upon which the galaxy spectra are 
projected to obtain the amplitude of the principal components con- 
tained within eac 

2 The constraint of orthogonality means that PCA must be applied with 
great care when analysing galaxy spectra, where the varied physics involved 



3.1 Input spectral dataset 

There are several options for creating an input dataset for spectral 
PCA analyses. Arguably one of the more favourable is to use the 
data themselves, thus avoiding any bias in the final results caused 
by the limited physics contained within models. However, while 
one of the oft quoted 'benefits' of PCA is its ability to identify 
correlations and variations in datasets without the need for calibra- 
tion with models and theory, any interpretation of observational re- 
sults ultimately depends upon some link between theory and data. 
Although spectral synthesis models contain some limitations that 
must be kept in mind by observers and modellers alike, they cur- 
rently provide the framework upon which much of our understand- 
ing of galaxy evolution is based. 

We must therefore choose at what stage the models and data 
are to be compared. For our specific purpose of deriving new high 
SNR indices to describe the stellar continuum and absorption lines 
of galaxies, there are several disadvantages to preforming the PCA 
on real galaxy spectra. First and foremost is the contamination of 
the data by emission line infilling of the Balmer series. As nebular 
emission varies greatly between galaxies, any PCA on a represen- 
tative populat ion of galaxy spectra primarily describe s the emission 
line strength dMadgwick et alJl2003l ; lYip et al.ll2004h . which quan- 
tifies current star formation rate and/or AGN strength alone. Due to 
potential AGN contamination, we are particularly interested in de- 
riving star formation indices that are independant of emission lines 
and it was found not to be possible to satisfactorily remove or mask 
the Balmer emission lines that occur in the center of the absorption 
features in which we are interested. Secondly, we intend our new 
method to be applicable to many datasets, allowing a direct com- 
parison between datasets just as with traditional indices. Were the 
PCA to be performed on each dataset individually, different prin- 
cipal components would be found and a comparison between the 
datasets becomes non-trivial. 

A second option, and the one chosen in this paper, is to 
introduce the spectral synthesis models from the start by creat- 
ing a model galaxy population. The set of models, based on the 
BC03 stellar population synth esis code, is simila r to that used in 
iKauffmann etafl J2003bt) and ISalim et al.1 J2005t) . although with 
more restricted parameter ranges. 6629 stellar populations are cre- 
ated obeying the following criteria: 

• Time of galaxy formation is distributed uniformly between 
and 5.7 Gyr after the Big Bang, where the age of the Universe is 
assumed to be 13.7 Gyr. 

• The underlying star formation rate is characterised by an ex- 
ponential decay rate distributed uniformly between 0.7 < 7 = 
1/t < lGyr- 1 

• Metallicity is distributed linearly in solar units between 0.5 < 

Z < 2Z Q 

No metallicity evolution is included in the model galaxies. Top 
hat bursts are then superposed "stochastically" on these underlying 
SFHs with equal probability at all times. These are parameterised 
in terms of j 'burst, the fraction of stellar mass formed in bursts in 
the past 2 Gyr to that formed by the underlying SFH during the 
total evolution of the galaxy, and the length of the burst, fburst 

in the integrated light of spectra leads to complicated patterns and correla- 
tions in spectral features. The wavelength region chosen in this study is 
exceptional in containing a few strong features which can be well described 
by an orthogonal basis set, thus allowing for simple interpretation of the 
resulting principal components. 
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is distributed logarithmically between 0.0 and 0.1 and the length 
of the burst is distributed uniformly between 0.03 and 0.3 Gyr. Fi- 
nally, the fraction of galaxies with ongoing starbursts is reduced to 
25% of those initially created leaving 6473 model galaxies, 29% of 
which have experienced a burst in the past 2 Gyr. The number of 
galaxies with ongoing starbursts was reduced because the starburst 
features were found to overly dominate the principal components 
for our requirements. 

It is important to be clear about the effect of the input sample 
on any PCA. PCA is a non-robust variance based technique: chang- 
ing the distribution of input galaxies greatly affects the output prin- 
cipal components. Additionally, PCA is affected by outliers in the 
dataset, and if principal components representative of the majority 
of the dataset are required then such outliers must be removed. Al- 
though we set out to recreate a population of early-type galaxies 
with varying degrees of recent "bursty" SFHs, the details of the in- 
put sample were selected through trial and error, with the aim of 
recovering principal components with good power to distinguish 
the age and strength of recent bursts. Once the components are cre- 
ated, the precise details of the input set become irrelavant, simply 
affecting the ability of the new indices to measure the physical pa- 
rameters we are interested in. We note that a more elegant way to 
achieve similar results may be th rough application of a method sim- 
ilar to the MOPED algorithm of iHeavens et al] 42000), to tailor the 
principal components from the outset to the parameters we wish 
to extract from the data. This algorithm has already been used to 
derive global s tar formation histo ries and metallicities from SDSS 
galaxy spectra dPanter et all2003h . At this stage however we prefer 
the more transparent approach of PCA, through which it is rela- 
tivley straightforward to understand our results and isolate imper- 
fections in the theoretical models. A PCA on the SDSS data was 
also performed and, with suitable masking of emission lines, found 
to produce qualitatively similar first and second principal compo- 
nents. 



3.2 Creating the "eigenspectra" 

In this and the following sections we explicitly refer to the SDSS 
dataset which is studied in this paper (Section [2j, however the 
procedure may be adapted to any other dataset. The 3750-4 150A 
wavelength range is extracted from the BC03 model galaxy spectra, 
the spectra are shifted to vacuum wavelengths, rebinned to match 
the logarithmic SDSS wavelength binning, and convolved to have 
velocity dispersions equal to 150km s -1 , the mean value of the 
dataset to which the new indices will be applied in this paper. The 
effect of velocity dispersion on the derived components will be dis- 
cussed further in Section [6] at this stage it suffices to note that 
the B aimer series lines in which we are particularly interested are 
strongly pressure broadened in main sequence dwarf stars, and thus 
in young and intermediate age stellar populations these particular 
line widths are relatively unaffected by the velocity dispersion of 
the galaxy. 

Each model spectrum is normalised to have total flux of unity, 
the mean spectrum of the input dataset is calculated and sub- 
tractecSfl, and the PCA is run to produce the principal components, 



3 The subtraction of the mean spectrum from a dataset prior to a principal 
component analysis is an important detail, acting in exactly the same way as 
removing the mean before calculating the scatter in a set of numbers. Fail- 
ure to do so imposes an approximate mean spectrum as the first component, 
and thus an unnecessary constraint on the remaining principal components 
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Figure 2. The eigenspectra: our new indices describe the amount of each 
eigenspectrum in a galaxy spectrum. The top spectrum is the "mean" spec- 
tram of the input dataset and the eigenspectra are ordered by the amount of 
variance within the dataset that they account for. The main absorption fea- 
tures in this wavelength range, the Balmer and Ca II lines, are marked. The 
first eigenspectrum measures the 4000A break strength and anti-correlated 
Balmer absorption line strength. The second eigenspectrum measures a de- 
crease in Balmer absorption and corresponding increase in the strong blue 
continuum of hot stars. The third eigenspectrum primarily measures any 
additional Ca II (H&K) absorption. 

termed eigenspectra from now on, which are to define our new 
spectral indices. 

3.3 The new indices 

Figure|2]presents the mean spectrum and first four eigenspectra of 
our input model galaxies. The mean spectrum is typical for that of 
an quiescent galaxy (see Figure [Tot; the first component shows the 
4000A break and corresponding inverse-correlation with Balmer 
line strength; the second component shows the Balmer series and 
inverse-correlation with the blue continuum shape seen in O and 
B type stars; the third component is primarily dominated by the 
Ca II (H&K) absorption lines, with Ca II (H) decorrelated from the 
broader He at the same wavelength; the fourth component contains 
further metallicity information, however, by this stage it becomes 
more difficult to find simple interpretations for the eigenspectra. 

to lie orthogonal to the mean spectrum. Consequently, they will not prop- 
erly describe the true lines of variance in the dataset. Note the purposeful 
difference in terminology between the mean spectrum of the dataset and the 
normalisation of individual spectra. 
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Figure 3. A galaxy with a strong A/F star component (bottom, black) is 
built up of the mean spectrum (second from bottom), plus given amounts 
(amplitudes) of the first three eigenspectra. Note that these are offset in 
the figure by an arbitrary flux for clarity. The amplitudes are the new in- 
dices. The resulting spectrum reconstruction (addition of the mean and the 
3 eigenspectra multiplied by the amplitudes) is overplotted in red. 



4 APPLICATION TO REAL DATA 

The basic procedure for measuring the indices in real data is 
straightforward: the galaxy spectrum is corrected for foreground 
Galactic extinction, moved to the rest frame, the mean spectrum 
of the input (i.e. BC03 model) sample is subtracted, and the resid- 
ual spectrum is projected onto the eigenspectra (i.e. the inner, or 
dot, product is calculated, see Appendix[A]l- The resulting principal 
component amplitudes represent the amount of each eigenspectrum 
present in the galaxy spectrum; these are our new indices. 

Often there are regions of a real spectrum which we do not 
wish to include in the analysis, for example, regions with bad sky 
line subtraction or regions affected by nebular emission lines. We 
mask these pixels with a process called Gappy-PCA, introdu ced to 
spectral analysis in astronomy bv lConnollv & Szalavl dl999l) . This 
allows pixels to contribute zero weight during the calculation of the 
principal component amplitudes, although at the price that the in- 
dependence of the individual components is degraded. In the SDSS 
dataset bad pixels are identified as those with error set to zero. Ad- 
ditionally we mask 5A either side of the Ne IIIA3870 line in spectra 
in which this line is detected at greater than 1.5a, and the centers 
of the US through H10 lines in spectra with equi valent width of US 
in emi ssion measured to be greater than lA by iBrinchmann et"al] 
d2004l) . The effect of emission line infilling is discussed further in 
Section Wl2\ Because the Balmer emission lines sit at the center of 
absorption lines, the resulting flux normalisation is systematically 
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Figure 4. The relation of the new spectral indices to traditional, well stud- 
ied, indices for SDSS galaxies. The dataset, described in Section|2] contains 
low redshift, high stellar surface mass density SDSS galaxies. For these 
plots only, due to the large errors on the measurement of H<5a in noisy 
spectra, galaxy spectra are required to have a per pixel SNR in the g-band 
greater than 20. 



biased by the masking procedure. A new Gappy-PCA algorithm 
has been developed by one of us (GL) which allows the flux nor- 
malisation as an additional free parameter during the calculation of 
the principal component^. This new procedure was applied to all 
spectra. 

Figure [3] shows how a galaxy spectrum with a dominant A 
star population may be constructed from the mean spectrum and 
differing amounts (amplitudes) of the first three eigenspectrum. 

It is very important to compare our new indices to the tra- 
ditional indices we wish to replace in order to quickly develop a 
full understanding of the information they contain. Figure [4] com- 
pares the amplitudes of the first two components, PCI and PC2, 
to D„ (4000) (4000A break strength) and U8 A (Lick index primar- 
ily describing US equivalent width) for the SDSS spectra presented 
in Section|2] We can see that PCI is equivalent to D n (4000) with a 
slight curve in the relation in the sense that the PC 1 axis is stretched 



4 The IDL code and documentation for the NormGappy-PCA 
procedure will be available on acceptance of the paper at 
http://w w w. voservices .net/spectrum/ 
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slightly for later type galaxies compared to early type galaxies. 
— PC1+PC2 is equivalent to H<5a for late type galaxies, but satu- 
rates in the early type galaxies. Note that PC2 alone is not equiva- 
lent to H5a, but to excess H5a over that expected for the galaxy's 
D n (4000) value. 

4.1 Improvement over old indices 

The simplest way to show the improvement of our new Balmer se- 
ries index over the traditional H<5a measure, and therefore the ben- 
efit of developing this relatively complex technique, is to compare 
the indices measured from duplicate observations of the same ob- 
ject. Many such observations exist within the SDSS survey, from 
entire plates which were re-observed and also objects observed 
on two different plates, due to plate overlaps. A list of duplicate 
galaxies in SDSS DR4 has been compiled and is available on the 
SDSS-MPA web pages. Figures [5] and [6] compare the scatter be- 
tween duplicate observations of D n (4000) and PCI, and H<5a and 
— PC1+PC2, as a function of observed frame (/-band SNR per pixel 
in both spectra. The improvement is clear, particularly for H<5a, al- 
though it should be recognised that — PC1+PC2 is not a measure of 
absorption line strength alone: information contained in continuum 
shape is also used. 



5 COMPARING MODELS AND DATA 

In the top-left panel of Figure|7]we plot the main diagnostic plot to 
be used throughout the remainder of this paper, PCI vs. PC2. This 
is our new equivalent of D n (4000) vs. H5a, although should be 
thought of as 4000A break strength vs. excess Balmer line strength, 
i.e. the inverse-correlation between D n (4000) and H<5a has already 
been accounted for in PCI. In the top-right panel of Figure [7] we 
plot PCI vs. PC3, the index which measures excess Ca II absorp- 
tion. For comparison with previous work, the bottom-left-hand plot 
in Figure[7]shows D n (4000) vs. H<5a for a high SNR subsample of 
the galaxies. In all panels our SDSS galaxy-bulge sample is plotted 
in greyscale; in regions of low number density, individual galax- 
ies are plotted as points. The quiescent cloud (red sequence) with 
large 4000A break strength is clear in all plots, the star forming 
sequence extends to lower values of PCI. The tail to the bottom 
left of PC 1/2, with small 4000A break and very weak Balmer lines, 
is composed of galaxy bulges with stellar populations dominated 
by O or B star spectra, i.e. experiencing a starburst. The 'hump' in 
PC 1/2 over the starforming sequence are stellar populations with 
stronger than average Balmer absorption lines, i.e. candidates for 
populations which experienced a strong burst of star formation in 
the past. 

The overplotted tracks follow the evolution of a 0.03 Gyr 
tophat model starburst, superimposed on a composite quiescent 
galaxy, created from galaxies in our quiescent class (Section [7. lb . 
This approach isolates the small discrepancies between the model 
and SDSS spectra for old populations (see Section [6~Tl (. The burst 
strength is characterised by the mass of stars formed during the 
burst, compared to the mean mass of the galaxies wihtin the com- 
posite spectrum; we plot four burst tracks of strengths 0.5%, 1%, 
3% and 20%. The stars indicate times after the onset of the burst of 
0.0001, 0.01, 0.2, 0.5, 1.0 and 1.5 Gyr. 

Looking at the starburst tracks in PC 1/2, we can see the fa- 
miliar degeneracy between the age and the strength of the burst. 
The "post-starburst" objects in the 'hump' could be galaxies that 
have undergone a massive starburst a relatively long time ago, or a 



smaller starburst more recently. This degeneracy is also present in 
D n (4000) vs. H<5a- PC3 allows the degeneracy to be broken how- 
ever, by making use of the rapidly changing Ca II line strength be- 
tween A and F stars. 

As discussed in Section Q] it remains unclear as to whether 
galaxies with strong Balmer absorption lines have indeed experi- 
enced a recent burst of star formation in the past (post-starburst), or 
whether the star formation has simply been shut-off (truncation), 
for example by interaction between the intracluster medium and 
the galaxy's interstellar medium as it enters a cluster environment 
or by feedback associated with the rapid growth of a black hole. For 
a single galaxy, distinguishing these scenarios from our stellar con- 
tinuum measures alone is difficult; shutting off the star formation 
in the continous star formation histories, causes galaxies to move 
upwards in PC2. However, taking the population of galaxy-bulges 
as a whole provides additional constraints. In particular, the trunca- 
tion model does not explain the relatively large number of bulges 
that are undergoing starbursts, while these are the natural progeni- 
tors of the post starbursts in our sample. We invoke Occam's Razor 
and will not discuss such truncation models further in this paper. 



6 POTENTIAL BIASES AND COMPLICATIONS 

In Figure[8]we investigate the effect of several potential systematic 
effects on the starburst evolution tracks: a change in stellar library, 
dust, changing the underlying old population, burst metallicity and 
an overall change in velocity dispersion. 

Stellar libraries: Figure [8ji compares the results us- 
ing two different observational stel lar libraries input int o the 
GALAX EV code of BC03, Stelib Ihe Borgne etal] 120031) and 
MILES JSanchez-Blazquez et al. 2006). Note that the implimen- 
tation of the MILES library in the GALAXEV code is preliminary. 
Very little difference is observed except for a small offset in the 
hottest stars. 

Dust: Figure [8}) shows the effects of dust in the burst stel- 
lar popul ation. The dust prescr iption implimented is the two phase 
model o f lCharlotfe Fall (2000) in which stars older than 10 years 
are extincted at a level of 30% that of the young stars. We com- 
pare dust free models with dusty model galaxies in which young 
stars suffer one magnitude of attenuation in the V-band {Av = 1). 
The presence of dust causes an apparent small reduction in burst 
strength. 

Underlying old population: The biggest effect on PC3 is 
caused by a change in mass of the underlying old stellar popula- 
tion. As young stellar populations have only weak Ca II lines, the 
old stellar population greatly affects their total strength. In Fig- 
ure [8J: a high and low stellar mass composite has been created 
[log(M„/M )< 10.25 or > 1 1.0]. The increase in PC 3 with stel- 
lar mass may be due to several effects: the high mass galaxies 
have higher velocity dispersions (see e), higher metallicities and 
are more likely to be Q-enhanced. 

Metallicity: Figure (8ji compares starburst tracks of two dif- 
ferent metallicities, Z — OAZq and Z — 2.5 Zq. The same 
quiescent composite has been used. In PC 1/2 the lower metallicity 
causes an apparent reduction in burst strength. In PC3 we see again 
that Ca II is effected by metallicity, but the effect only becomes 
visible in intermediate age populations where Ca II is stronger. 

Velocity Dispersion: Figure[8]s compares models with veloc- 
ity dispersions of 112 and 225kms~ 1 . Recall that the eigenspec- 
tra are created with velocity dispersions of 150kms _1 . To isolate 
the velocity dispersion effect, the same low stellar mass quiescent 
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Figure 5. Comparison of the scatter between measurements of D n (4000) (top) and PCI (bottom) in SDSS galaxies which have been observed twice. The 
sample is split according to per pixel SNR in the g-band as indicated in the top left. Median lcr statistical errors on the quantities for each sample are indicated 
in the upper left. 
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Figure 6. Comparison of the scatter between two measurements of H<5a (top) and — PC1+PC2 (bottom) in SDSS galaxies which have been observed twice. 
The sample is split according to per pixel SNR in the g-band as indicated in the top left. Median lcr statistical errors on the quantities for each sample are 
indicated in the upper left. 
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Figure 7. In grayscale: the joint distributions of PCI vs. PC2 (top left) and PCI vs. PC3 (top right) and D n (4000) vs. HS\ for the spectra of high stellar surface 
mass density, low-z SDSS galaxies. The greyscale indicates logarithmic number of objects. In regions of low number density, individual points are plotted in 
black. The galaxies in the D„(4000)/H<5a plot are additionally constrained to have g-band SNR/pixel greater than 20. PCI is equivalent to D n (4000), PC2 
measures excess Balmer line strength and PC3 measures excess Ca II line strength. Overplotted as tracks are model Bruzual & Chariot (2003) tophat starbursts 
of 0.03 Gyr duration, superimposed on a composite early type galaxy spectrum. Asterixes indicate times after the initial starburst of 0.001,0.01,0.1,0.5,1.0 and 
1.5 Gyr. With a mean stellar mass of 4.3el0 Mq for the early type galaxies in the composite, burst mass fractions are 0.5% (orange), 1% (red) and 3% (blue) 
and 20% (purple). 



population has been used for both, and convolved up to the higher 
velocity dispersion for the dotted tracks. A small effect is seen on 
all components. 

6.1 Offset in indices between models and data 

We note that there is an offset between the exponential star forma- 
tion history BC03 model galaxies and the SDSS data in D„(4000) 
vs. US a, where none w as previously apparent (see Figure 3 in 
iKauffmann et alj |2003b). This mismatch results from spectropho- 
tometric calibration differences bet ween the SDSS dataset a nd stars 
in the Stelib stellar spectral library dLe Borgne etafll2003l) . which 
underpins the BC03 models. The difference between earlier work 
and the current analysis is primarily in the spectrophotometric cal- 
ibration of the SDSS spectra between DR1 and DR2 (Abazajian 
et al., 2004). A new version of m odels is under construction us - 
ing the new MILES stellar library dSanchez- Blaz quez et al] |2006). 
which solves this problem (G. Bruzual, private communication). 
The magnitude of the effect is ^0. 1 mag in Dn(4000) and lA in 
H5a (see also the comparison in Sanchez-Blazque z et alj200r3) . 

The effect on the principal component amplitudes of a mis- 
match in 4000A break strength between models and data is not 



immediately intuitive. PC2 is in fact worst affected, because the 
primary variance in the data is the shape of the continuum and it 
is this that PCA concentrates on fitting first. Because of the anti- 
correlation between 4000A break and Balmer line strength, the best 
fitting first component then has too strong Balmer lines and PC2 
must be used to counterbalance some of this. An additional effect, 
visible in the reconstructions of some starforming spectra, is a sub- 
sequent mismatch in the far blue continuum shape. An offset in PC3 
is expected, due to the incorrect amount of Ca II being introduced 
by the other two components. 



We expect the current problem with the spectral synthesis 
models to be solved in the very near future, however, it is clear 
that systematic errors on derived parameters from spectral synthesis 
models may be significantly larger than the statistical errors usually 
quoted. In this paper we are uneffected by such problems, as we do 
not attempt to derive full quantitative solutions of burst parameters 
for our galaxies. In future papers it will be important to consider 
comparison between different spectral sy nthesis models, such as 
those based on synthetic stellar libraries (Gonzalez Delgad o et al.l 
120051) . 
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6.2 Emission line infilling 

One of the most difficult parts of fitting models to the stellar compo- 
nent of a galaxy spectrum is the contamination of the data spectrum 
by non-stellar light. Nebular emission lines are one such contam- 



inant that we must be particularly cautious of, as they affect the 
B aimer absorption lines in which we are particularly interested. 

We selected a subsample of objects with H<5 emission and 
compared the three principal component amplitudes obtained with 
and without masking the centers of the H<5 through H10 absorption 
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lines. A clear emission line equivalent width limit was found below 
which the amplitudes remained unbiased. Above this limit (given 
in Section @ we mask the centers of the absorption lines before 
calculating the amplitudes. Additionally, the flux normalisation of 
the spectra is crucial, as the emission lines sit preferentially in the 
centers of absorption lines, it is thus preferable to leave normali- 
sation as a free parameter when calculating the component ampli- 
tudes (Section[4]l. 

We note that for strongly star-forming systems, the second 
principal component primarily fits the overall shape of the contin- 
uum i.e. the strong blue emission from the hot stars, rather than the 
strength of the Balmer absorption features. 



1 7 




6.3 Contamination by AGN continuum light 



There is a population of objects which scatter below the star- 
forming branch of the PC 1/2 plane, with weak Balmer absorp- 
tion lines for their 4000A break strength. Further investigation of 
these spectra show them to be objects with apparently old stel- 
lar populations, but some excess blue continua. A proportion of 
these are weak Type 1 AGN in which some broad line component 
is visible in the stronger Balmer emission lines. Many others are 
found to have radio (FIRST) and/or soft X -ray (RASS) counter - 
parts, and are most likely weak BL Lacs (e.g. ICollinge et alj2005l) . 
To the PCA analysis the featureless blue AGN continuum looks 
very similar to the blue continuum of young O and B stars, weak- 
enin g the Balmer se ries and 4000A break in a similar way (see 
also lHao et alj|2005h . A stellar population study of these objects 
from their optical spectra alone is beyond the scope of this ini- 
tial paper, and we simply treat them as a separate class of objects. 
Dust- or electron-scattered continuum light from the hidden AGN 
in Type 2 AGN (the AGN type that dominates our sample) has been 
shown contribute ne glibly to the blue continuum light in all but the 
most extreme AGN fSchmitt et al.ll 19991: iKauffmann et alj [2003a; 
IZakamska etal1l2006t) . 



7 RESULTS: THE RECENT SFH OF LOCAL GALAXY 
BULGES 

We now turn to the application of our method presented in Sections 
[3] and [4] to the SDSS dataset presented in Section [2] We focus on 
the AGN properties as a function of recent star formation history 
in the bulge, and in particular on the properties of the bulges with 
excess Balmer absorption which have undergone a more unusual 
recent star formation history. 

The tight correlation observed between supermassive 
black hole mass and ve l ocity dispersion in g alactic bulges 
dFerrarese & Merrittl |2000| : iGebhardt et all l2000l) is well ex- 
plained by theoretical models in which black hole and spheroid 



Haeh nelt & Kauffmanrj|200(j| ; lAdams et all 
2004). Observationally there is much 



growt h are linked (e. 
1200 ll ; iGranato et ail 
evidence for the coincidence of recent or ongoi ng star forma- 
tion acti vity with AGN activity in Se yf ert 2s dHeckman et al. 



1997|: IGonzalez Delgado & Heckmanl Il999l: ISchmitt et al 
iGonzal 



Cid Fernandes et alj|2004l). type 2 Q SC 
LINERS dCid Fernandes eTall 12003) i 



19991; IGonzalez Delgado et al.1 l200lT ; iRaimann et alj 120031 ; 



SOs dZakamska etaUl2006h . 



and powerful FRII radio 
galaxies dTadhunter et alj|2005l) . This strongly suggests a positive 
link between the accretion of matter onto the black hole and an 
increase in bulge size through starbursts. However, the relationship 
is clearly complicated, with the fraction of objects identified with 



Figure 9. Defining classes of stellar population for galaxy bulges. The joint 
distribution of PCI vs. PC2 with galaxy bulges classified as quiescent (red), 
green-valley (green), star-forming (cyan), starburst (blue) and post-starburst 
(brown) and other (black) according to their position in the PC 1/2 plane. We 
caution that there are no distinct breaks in the distribution, and each class 
is a continuation of the others. To guide the eye in comparison with subse- 
quent figures, a single weak instantaneous burst track of 1 % burst fraction 
is overplotted. 



young stellar populations apparently varying with AGN type and 
strength. Uncovering the stellar populations of Type I AGN hosts 
remains very technically challenging, with limited agreement 
between different studies ( Canalizo & Stockton 2000l ; iNolan et al.1 
l200ll : ICanalizo et alj|2006l : lYip et alj2004h . 

The majority of present day black hole growth is found to 
occur durin g relatively high accre tion rate phases on to low mass 
black holes dHeckman et al. 2004). We may expect to find that re- 
cent star formation correlates with accretion rate, with newly avail- 
able gas resevoirs at the center of the galaxy either concurrently 
forming stars and feeding the black hole, or forming stars which 
subsequently fuel the black hole. Large homogeneous samples are 
required for such an analysis, such as those available with spectro- 
scopic studies such as the SDSS. In these studies, less detailed data 
is compensated for by being able to study the average prope rties 
of hundreds of thousands of objects. Kauffmann et al. ( 20033) de- 
tect significant trends with AGN strength in the stellar populations 
of galaxies hosting obscured AGN, finding that AGN with higher 
[O III] luminosity are contained in hosts with younger mean stellar 
ages and with a higher likelihood to have undergone a recent star- 
burst. In this section we follow on from this work using our new, 
higher SNR stellar population indicators to quantify more precisely 
the recent star formation of central galaxy bulges with and without 
the presence of an AGN. We investigate trends with [O III] lumi- 
nosity, dust content and irregularity of morphology. 



7.1 Stellar population classes 

Our sample of galaxies is described in detail in Section[2] In Figure 
[9] we have split the sample into six different classes according to 
their position in the PC 1/2 plane: quiescent, "green- valley", star- 
forming, starburst and post-starburs{j. Except for the post-starburst 



5 The sixth class contains objects that appear to be contaminated by scat- 
tered AGN light (see Section 16.31 . As the line parameters derived from 
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Table 1. The number of objects in each class, classified by narrow emission line ratios as AGN/composite, star-forming or unclassifiable (due to non-existent 
or weak emission lines). The values in brackets are after correcting the numbers for su rvey volume effect s using the 1/V ma x method (Section [2). The final 
column gives the mean of the logarithmic specific star formation rate (SSFR) derived by Brinchmann et al. 1 2004) for each class. 



Class 


AGN+comp 


SF 


unclass 


Total 


(log(SSFR)) 


Quiescent 


5226 (6128.) 


54 ( 63.) 


13178 ( 15330.) 


18458 (21523.) 


-11.77 


Green Valley 


2297 (2601.) 


180(209.) 


953 ( 1088.) 


3430 ( 3900.) 


-11.15 


Star forming 


3611 (4090.) 


5056 ( 5729.) 


1027 ( 1191.) 


9694 ( 11010.) 


-10.26 


Starburst 


58 ( 66.) 


694 ( 770.) 


6(7.) 


758 ( 844.) 


-9.68 


Post-starburst 


364(415.) 


178(215.) 


92 ( 104.) 


634 ( 735.) 


-10.41 


Other 


195 ( 233.) 


195 ( 235.) 


150 ( 190.) 


540 ( 659.) 


-10.67 


total 


11751 ( 13537.) 


6357 ( 7222.) 


15406 ( 17913.) 


33514(38673.) 


-11.19 



and quiescent populations, the exact positioning of the class divi- 
sions is fairly arbitrary as there are no well defined minima, as seen, 
for example, in the bimodality of the galaxy population as a whole. 
Table Q] presents the numbers of galaxies within each class, split 
according to their emission line ratios as AGN or star-formation 
dominated. To give some physical meaning to each class, the final 
column gives the mean l ogarithmic specific star f ormation rate of 
each class, as derived bv lBrinchmann etHll d2004l) . see e.g Fig. 25 
of that paper for a comparison with all star-forming galaxies. We 
note that for AGN, these values are derived from D„(4000) rather 
than the emission lines. FigurefTOlpresents composite spectra of all 
galaxies in each class. Note the strong emission lines present in the 
post-starburst class: selection of such objects based on the lack of 
[O II] or Ha emission lines will select only a very small subsample 
of those galaxies found to have excess A star popula tions in this 
paper. We will return to this point in Section [83l Brinc hmann et alj 
(120041) showed that D n (4000) correlates well with specific SFR 
within the fibre aperture derived from emission line strengths, and 
Figure [4] shows that PCI is equivalent to D n (4000), therefore, ex- 
cept for the post-starburst population, our different classes simply 
describe a sequence in specific SFR. 

In our simple instantaneous burst model, the galaxy bulge 
leaves the quiescent (red sequence) or green- valley populations as 
soon as a central starburst occurs and, if the starburst is strong 
enough, almost instantaneously appears in the bottom left of the 
diagram. This stellar population ages, without new stars being 
formed, and moves along a track similar to the ones indicated in 
Figure [7] with the precise track depending primarily on the frac- 
tion of stars formed in the burst. Before a starburst of more than 
~1% mass fraction returns into the green valley, or red sequence, 
it passes through the "post-starburst" phase where excess Balmer 
absorption is evident. Within this model, some of the bulges classi- 
fied as "star-forming" may simply be undergoing smaller, or more 
extended bursts of star formation and are not necessarily equivalent 
to the star-forming branch of the general galaxy population which 
are believed to have experienced almost constant star formation for 
most of their lives. 

In the following subsections we will look in detail at the 
morphological, dust and AGN properties of each of these classes. 
As discussed in Section [2] because our sample is magnitude lim- 
ited, throughout the paper we weight each galaxy contributing to 
a mean or total quantity by the inverse of the maximum volume 
in which it may be observed in the survey. This correction is in- 



model fits to these spectra are unlikely to be robust, we do not discuss this 
class further here. 



eluded in all our figures where appropriate. In the following sub- 
sections, 2-dimensional histograms weighted by physical proper- 
ties were cr eated using an adap t ed ve rsion of the adaptive mesh 
algorithm of Cappellar i & Copirj d2003l) . in order to avoid bins con- 
taining a small number of objects biasing our perception of overall 
trends. Bin boundaries are set such that each bin has roughly con- 
stant signal-to-noise ratio equal to the square root of the number of 
galaxies contributing to that bin. 



7.2 Morphology 

The merger of, or interaction between, two galaxies is known to 
induce starbursts and recent major mergers are often suggested to 
be the cause of galaxies with strong g lobal post-starburst stellar 
populations (e.g. lLiu & Kennicuttll 19951 ). N umerical simulations of 
gas rich mergers al so produce starbursts jPi Matteo et al J [20051 : 
ICattaneo et al]2005l) and indicate that dynamical signatures on the 
merger remnants can remain for as much as several Gyr after the 
original merger dCox et af, 2006). Even minor mergers may cause 
the triggering of nuclear starbursts, as small disruptions to the grav- 
itational potential of a galaxy all ow gas to lose angular mo mentum 
and flow towards the center (e.g. Mihos & Hem quistll 1 994h . In this 
section we investigate trends of global galaxy morphology (from 
SDSS photometry) with the stellar populations in the central galaxy 
bulges. 

To give a visual impression of the type of galaxies that host 
bulges residing in the different regions of the PC1/PC2 diagram, 
Figures lB"Tl to |B5| show montages of 1 arcmin square SDSS postage 
stamp images of samples of objects from each main region: the 
quiescent, green-valley (younger mean stellar age than the quies- 
cent), star-forming sequence, starbursts (spectra dominated by O 
and B stars), post-starbursts (stronger Balmer absorption lines). In 
the top left of each montage, the size of an SDSS fibre is indicated: 
our spectral analysis relates only to the central population of stars. 
Only objects with z < 0.05 have been selected for these montages, 
where details such as disturbed morpology and disks are more read- 
ily visible. 

A few obvious trends are worth noting. Figure IB 1 1 Bulges 
with quiescent stellar populations are in general hosted by elliptical 
and SO galaxies. Figure lB2l Galaxies with bulges which lie in the 
"green-valley" show a greater preponderance of outer disk struc- 
tures and irregular morphologies than those with quiescent popula- 
tions. Figure lB3l A few distrubed morphologies are also apparent 
in hosts with star-forming bulges and the galaxies are more likely 
to have disks, particularly as their specific SFR increases (PCI de- 
creases). Figure lB4l About half of the galaxies with bulge stellar 
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Figure 10. Composite spectra of galaxy-bulges within each PCA-defined "class". The inset shows the composite in the wavelength region used during our 
analysis of the recent SFH of the galaxies, normalised to the same flux scale for each class. 



populations that are dominated by very young O and B stars (strong 
starbursts) are globally morphologically distur bed, however, it is 
clearly not a necessary requirement and few iToomre & Toomrel 
dl972[) mergers are found in our high stellar surface mass density 
sample. Finally, in Figure |B51 bulges with excess Balmer absorp- 
tion lines reside in on average slightly more compact systems, how- 



ever evidence of recent dynamical disturbance, such as dust lanes, 
tidal tails or close companions are visible in more than half. We 
will return to the last Figure in the series in Section HOl 

A quantitative analysis of the global structure of the galax- 
ies is possible via the "concentration index" which distinguishes 
ellipticals from spirals based on their radially averaged light dis- 
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Figure 11. Galaxy morphology as a function of stellar population of the bulge. Each 2D histogram bin has been weighted by the mean volume-weighted 
concentration index (left) or mean volume-weighted lopsidedness (right) of the galaxies within that bin. Concentration index is defined to be the ratio of 
Petrosian radii R90/R50 in the r-band; elliptical galaxies in general have C>2.6. Lopsidedness quantifies any large scale overabundance of light on one 
side and corresponding underabundance on the opposite side of the galaxy and is measured in the i-band iReichard et al. 2007). It thus provides information 
additional to the radially averaged concentration index and is sensitive to morphological disturbance. Note that fewer galaxies have measured lopsidedness 
values than concentration indices (see text). A single weak instantaneous burst track of 1% burst fraction is overplotted. 
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Figure 12. Left: The volume-weighted mean z-band dust attenuation of galaxies in each joint PCI/2 histogram bin (A z from Kauff mann et al.l200 3b). Right: 
The volume-weighted mean Bal mer decrement of galaxie s in each bin, defined as /hq/(/h/3) where fn a (fup) is the emission flux of Ha (H/3) corrected 
for underlying stellar absorption iBrinchmann et al. 2004). Note that fewer galaxies have measured Balmer decrements than A z measurements, because the 
former require significant emission line fluxes. A single weak instantaneous burst track of 1% burst fraction is overplotted. 



tributions. We define the concentration index to be the ratio of the 
radii containing 90% and 50% of the Petrosian flux in the r-band 
(C- =R90/R50). This par ameter has been sho wn to be correlated 
with galaxy morphology dStrateva et alj|200lh : elliptical galaxies 
have values around 3 and disk-dominated galaxies have values 
around 2 to 2.5. The classes are often divided at 2.6. In FigurefTTb 
we weight each joint PC1/PC2 histogram bin by the mean con- 
centration of galaxies contained within that bin. There are two no- 
ticeable trends. Firstly, as suggested by the image montages, the 
galaxies hosting bulges with progressively larger specific star for- 
mation rates (more negative PCI) have lower mean CY i.e. more 
exponential profiles. Secondly, the post-starburst bulges exist in, 
on average, more centrally concentrated (higher C r ) galaxies than 
galaxies with similar mean stellar ages. Finally, it is noticeable that 
the stongest starburst bulges to the far right of the figure are found 
to have slightly higher concentrations than the galaxies with more 
ordinary recent star formation histories and less negative PC 1 . This 



is evidence in favour of our suggested toy model, i.e. that the star- 
burst class is formed by otherwise quiescent bulges experiencing 
bursts of star formation, rather than being a more extreme class of 
the ordinary star forming branch of galaxies. 

A statistical measure of galaxy 'lopsidedness' dReichard et al.l 

120071) . allows us to investigate the second order distribution of light 
in a galaxy, beyond the radially averaged concentration index. A 
galaxy's lopsidedness is quantified through a fourier decomposition 
of its light, such that the first component quantifies any large scale 
overabundance of light on one side and corresponding underabun- 
dance on the opposite side of the galaxy. Lopsidedness has only 
been reliably measured for galaxies with z < 0.06 or Petrosian 
r-band magnitude < 16.8; very inclined galaxies, with b/a< 0.4, 
are also excluded. Our sample contains 16722 galaxies. The lopsid- 
edness values used in this paper are derived from the i-band light, 
which best represents the older stellar population and therefore true 
perturbations in the gravitational potential of the galaxy rather than 
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uneven ongoing star formation. In Figure [TTb we weight each joint 
PC1/PC2 histogram bin by the mean i-band lopsidedness of galax- 
ies contained within that bin. We see the same trend of increas- 
ing lopsidedness with decreasing mean stellar age as observed in 
D„(4000)/H<5a by Reichard et al. We also see that the most lop- 
sided galaxies are those that lie along the left edge of our distri- 
bution i.e. galaxies hosting bulges with the strongest ongoing star 
formation through into the post-starburst bulge class. Excess lop- 
sidedness exists right into the post-starburst bulge class, where their 
concentration indexes have already shown that they are becoming 
more elliptical. However, it is clear that these are a mixed popula- 
tion in terms of lopsidedness and investigation of objects on a more 
individual basis is warranted. 

The image montages alone present a strong case for a phys- 
ical link between the recent SFH of the bulge and global galaxy 
structure, although any relationship is clearly far from one-to-one. 
The trends we find of galaxy concentration and lopsidedness with 
recent SFH of the bulge strongly suggest that at least some of the 
central starbursts and post-starbursts are caused by mergers, with 
galaxies becoming more centrally concentrated after the merger, 
but with excess lopsidedness existing up to a Gyr after the starburst 
has occurred in the bulge. 

7.3 Dust 

The presence of dust (cold gas) is intimately connected with star 
formation. In mergers large quantities of gas can be concentrated 
near the nucleus and the resulting high dust column densities lead to 
very large attenuations: extreme examples are the Ultra Luminous 
Infra Red Galaxies (ULIRGS). Dust is also produced in significant 
quantities by Asymptotic Giant Branch (AGB) stars, which first ap- 
pear around 10 8 years after a starburst. This is similar to the prob- 
able ages for our post-starbursts. It has been known for many years 
that dust is also present in many galaxies that are morphologically 
classified as early- type, in contradiction to the standard view of the 
Hubble sequence dSadler & Gerhardlll985l). The presence of dust 
and an AGN may also be closely linked (Kauffmann et alj|2006l : 
lLopes et"al]|2007l . and references therein). 

In Figure 112b each histogram bin is weighted by mean dust 
content of galaxies in that bin, as measured by the attenuation 
in the z- band (A z ) derived from t he colour of the stellar con- 
tinuum by Kau ffmann et al.l d2003bh . In Figure 112b the bins are 
weighted by the observed Ha to H/3 emission line flux ratio 
(Balm er decrement), after cor recting for underlying stellar absorp- 
tion {Brinchmann et al. 2004). Only those galaxies with Ha and 
H/3 flux measured at greater than 3a confidence and flux in the 
H/3 line is greater than 4 x 10~ 16 erg s" 1 cm" 2 are included in the 
latter figure. To first order A z can be thought of as attenuation of 
light from the older stellar population, and the Balmer decrement 
measures the dust attenuation suffered by light from the younger 
stellar population. In general they are well correlated in the SDSS 
galaxies, albeit with significant scatter, suggesting that star forming 
regions and older stellar populations are on average well mixed. 

The expected overall trend of increasing dust content with in- 
creasing specific SFR is seen in both panels of the figure. How- 
ever, two very interesting trends are seen along the strong starburst 
track, particularly in panel b. Firstly, the starburst galaxies are less 
dusty on average than galaxies at slightly lower PCI (lower spe- 
cific SFR). Secondly, a sharp increase in dust content is observed in 
the strong Balmer absorption line objects. Qualitatively, this trend 
fits with our toy model, in which the objects along the strong star- 
burst track have experienced a recent sharp increase in their star 



formation rates and evolve into the post-starburst objects. Our plots 
suggest that this occurs as the first generation of stars from the star- 
burst begin to produce dust. While the spread in observed Balmer 
decrements in this class is large, the post-starburst bulges appear to 
have on average considerably more dust surrounding the line emit- 
ting sources, compared to diffuse interstellar dust, than seen in any 
other class of galaxy bulges. We will return to this point in more 
detail in the discussion. 



7.4 The stellar populations of AGN host bulges 

One of the key aims of this project was to quantify the very recent 
star formation history of bulge stellar populations without relying 
on nebular emission lines which, in a large fraction of our sam- 
ple, are contaminated by AGN emission. In this subsection we ex- 
ploit our success by examining the stellar populations in the bulges 
hosting the AGN. We use the attenuation-corrected [O III] lumi- 
nosity as an indicator of AGN luminosity because [O III] is gener- 
ally the strongest optical emission line of Type 2 AGN, and suffers 
significantly less than the other stron g lines from contami nation 
by emission excited by star formation iHeckman et al.l J2004l) find 
that >90% of [O III] lu minosity arises from t he AGN when objects 
lie above the theoretical Kewle v et al. I d200lh demarcation line on 
the BPT diagram, although this can drop to as low as 50% as ob- 
je cts approach the empirica lly determined, less stringent division 
of iKauffmann et al.l J20033) . 

We first ask the question: where do the bulges that host AGN 
live in PC 1/2 space? In Figure[T3lwe show the joint number count 
distributions in PC 1 and PC2 for our dataset split into four samples 
according to their narrow emission line ratios: pure-AGN, AGN in- 
cluding composite objects, pure-star-forming and unclassified (see 
Section |2~T| >. As with all figures and quoted numbers, galaxies are 
weighted by 1/V m ax so that these figures reflect the true comoving 
number density of each class. Galaxies which lie in the lowest den- 
sity regions of the plots are included as individual points to indicate 
the distribution of outliers. As usual, to guide the eye in comparison 
between different panels, a single, weak instantaneous burst track 
is overplotted with a burst fraction of 1% (see caption to Figure|7](. 

Figure |13b shows that the host galaxies of "pure" AGN 
have predominantly quiescent stellar populations, although some 
are found in star-forming galaxies and a large fraction of post- 
starbursts fall into this category. If the composites are included, 
however, AGN span a much wider range in star formation his- 
tory (Figure [T3f>). AGN are less likely to be found in bulges with 
very young mean stellar ages (Figure [T3b); it is likely, however, 
that some strongly star-forming galaxies do contain AGN, but the 
emission lines from the narrow-line region are weaker than those 
produced by the H II regions in the galaxy, and are thus not de- 
tectable. As expected, the unclassified objects (Figure [T3b) lie pre- 
dominantly in the quiescent cloud, i.e. they have little ongoing star 
formation. Some unclassified objects are also found extending into 
the star-forming and post-starburst region; as we will show in sec- 
tion [7A2] particularly in the case of the post-starburst bulges this 
is caused by strong dust obscuration of the shorter-wavelenth emis- 
sion lines. 



7.4. 1 Contribution to AGN luminosity and black hole growth rate 

The AGN in our sample span a broad range in power (broad range 
in [O III] luminosity), so Figure[T3]paints a potentially misleading 
picture of the link between the star formation history of bulges and 
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Figure 13. Joint number distribution (volume-weighted) of PCI and PC2 for SDSS galaxies classified as pure AGN, AGN (including composite objects), pure 
star-forming, or unclassified. In regions of low number density, individual points are plotted. Objects are classified according to their narrow emission line flux 
ratios [N II]/Hq and [O III]/H/3. Unclassified objects are those without all emission lines above the requisite SNR. As can be seen, this is in general because 
their stellar populations are old. A single weak instantaneous burst track of 1 % burst fraction is overplotted. 



the growth of black holes. Thus, in Figure [14] we use the [O III] 
luminosity of AGN to illustrate the distribution of the growth of 
black holes as a function of the star formation history of their host 
bulge. In this figure, we include all AGN and composite objects, but 
exclude objects with lines dominated by star formation and objects 
for which an accurate (> 3a confidence) Balmer decrement, or 
[O III] line flux, could not be determined. The double power-law 
dust correction (Equation[3]l is applied in these figures. 

We now ask the question: given the broad range in star for- 
mation rate and star formation history, in which bulges is the ma- 
jority of current black hole growth occuring? The result is shown 
in Figure [l4h. where each histogram bin in the plot represents the 
volume-integrated dust-corrected [O III] luminosity of the bulges 
contained within each 0.1 x 0.1 PCI xPC2 pixel, smoothed to the 
same binning as in the other panels of the figure. Despite the large 
range in the number density of bulges per pixel across this diagram 
(see Figure[T3T>), the total AGN luminosity is distributed fairly uni- 
formly across bulges with all types of star-forming stellar popula- 
tion. A decrease is seen in the quiescent bulges, despite their large 
numbers, and also in the strongly starbursting objects. We will re- 
turn to a more quantitative analysis of the total and mean [O III] 
luminosities of each population shortly. 

Next, we ask the question: what type of star formation history 
is associated with the highest mean rate of black hole growth? Fig- 
ure!14[) indicates the mean AGN [O III] luminosity divided by black 



hole mass in each bin (using at as a proxy for black hole mass and 
applying a volume weighting as usual). Thus the colourscale coding 
represents the mean growth rate of black holes (in Eddington units). 
It is clear that on average AGN grow most strongly, i. e. have high 
accretion rates, in host bulges which follow the strong starburst tra- 
jectory, from starburst to post-starburst. As we saw in the last sec- 
tion, objects lieing in this region of PC space are also more likely 
to show evidence of morphological disturbance, certainly strongly 
suggestive of a connection between black hole growth, the growth 
of the stellar bulge and perturbation of the gravitational potential 
by external forces. 

While the bulges lying along the starburst/post-starburst tra- 
jectory have the highest mean black hole growth rates, they are 
relatively few in number. Overplotted in Figure [T4b as red points 
are the 207 bulges with the highest ratios of [O III] luminosity to 
black hole mass. Using the more conservative double power-law 
dust attenuation correction, these systems taken together contribute 
half of the total volume-integrated black hole accretion rate of our 
entire sample of over 33,000 bulges! It can be seen that these are in 
fact distributed fairly evenly across the whole population of young 
bulges, with 75% lying in the star-forming class, 8% in the star- 
bursts and 9% in the post-starbursts. In numbers they account for 
7% of the bulges star-forming bulges, 15% of the post-starbursts 
and 29% of the starbursts. In summary a strong recent or ongoing 
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Figure 14. Top: The volume-xverghted total dust-attenuation-corrected 
[O III] luminosity of the AGN in each joint 0.1 X 0.1 PCI xPC2 histogram 
pixel, smoothed onto the same adaptive bins as figures b and c. The colour 
scale indicates the logarithm of the total luminosity. All luminosities are 
measured in solar units. Middle: The volume-weighted mean dust-corrected 
[O III] luminosity normalised by <r* in each bin. The black points indicate 
the positions of the galaxy bulges with the highest accretion rates, which 
taken together account for 50% of the total accretion rate of our entire sam- 
ple. Bottom: The distance of each system away from the star formation 
locus in the BPT diagram. LINERs and Seyferts are found at large D, com- 
posite objects at smaller D. A single weak instantaneous burst track of 1% 
burst fraction is overplotted. In both the top and center panels, the double 
power-law dust correction has been used (Equation[5J. 



starburst within a bulge is a helpful, but not necessary, condition to 
feed a strongly accreting AGN. 

Putting the results of Figures [l4ji and b together we can see a 
new aspect of the fact that the growth of black holes in the present 
day Universe is spread broadly over all [O III] luminosities i.e. there 
are many weak [O III] systems and fewer strong [O III] systems, 
but all contribute significantly to the overall growth of black holes 
today dHeckman et al.ll 2004: Hao et al. 2005). We add to this result 
by showing that the growth of black holes today is spread widely 
over bulges with all types of recent star formation history, with the 
few strong [O III] systems existing in the rarer strong starburst and 
post-starburst bulges, and the weak [O III] systems arising in the 
more common bulges with weaker star formation. In the following 
subsection we will present these results quantitatively. 

Finally, Figure 114b shows the mean distance of objects 
away from the star-for ming locus on the BPT diagram (D, see 
iKauffmann et alfe oCBa). where composite objects have low D and 
Seyferts and LINERs have larger D; the transition from composite 
objects into pure-AGN dominated objects occurs at around D=0.7. 
Again, strong trends with stellar population of the bulge are evident 
in the sense that the AGN hosted by quiescent and post-starburst 
bulges have emission line ratios which place them further into the 
LINER and Seyfert class, and bulges with even moderately young 
stellar populations host AGN predominantly classified as compos- 
ite objects. For a recent study of the host galaxy properties of 
AGN in the SPSS a s a function of emission line classification see 
iKewlev et aIU 2006). While it is clear that the post-starburst bulges 
are on average more "AGN-like" in their emission lines than galax- 
ies with the same 4000A break strength, it is not currently clear 
whether this can be explained entirely by the reduced contribution 
of ongoing star formation. 



7.4.2 AGN luminosity and the importance of dust correction 

We now look more quantitatively at the total [O III] luminosity 
contributed by AGN in bulges with different recent star formation 
histories, splitting the bulge sample into the stellar population sub- 
classes defined in Section[7]E] 

In the top panels of Figure Q3] the total [O III] luminosity of 
AGN contained in each class of galaxy bulge is plotted, before and 
after correcting the [O III] flux for dust attenuation; the results are 
also given in Table [2] All results are shown for the two dust laws 
given in Section l2. 1.1 l and in the following text results are given first 
for the single power-law dust attenuation correction, and in brack- 
ets for the double power-law correction. We note that although the 
general trends in our results are not changed by which dust law we 
use, the absolute values of the mean [O III] luminosities are con- 
siderably different. Objects without the requisite lines with which 



6 At this point it is necessary to discuss the contamination of the [O III] 
emission by star formation. Can our results be taken as reliable indicators 
of the relative amount of AGN activity occuring in galaxies as a function of 
recent bulge star formation history? Our worst affected class of bulges will 
clearly be the starbursts, however even in this class, the mean [O III] lu- 
minosity in those objects classified as having emission lines predominantly 
originating from star formation is more than a factor of three lower than in 
those objects classified as composite objects or pure AGN. The contribution 
by star formation to the [O III] luminosities of objects classified as AGN can 
thus be expected to be small on average. On the other hand, relatively weak 
AGN in bulges with very high star formation rates may not be recognised 
as such. In this case the contribution to the overall growth of black holes in 
the starburst class may be an underestimate. 
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Figure 15. The total (volume-integrated)(to/>) and mean (bottom) [O III] luminosity of AGN hosted by different classes of bulges, before (left) and after (center 
and right) correction of the [O III] flux for attenuation by dust. The dust corrected results are shown for the two different attenuation laws discussed in Section 
12.11 The classes are labelled as: Qu quiescent; Gr green-valley; SF star-forming; SB starburst; PSB post-starburst. The results are also presented in Tableff] 



Table 2. The fraction (F) of the total volume-integrated, and mean [O III] luminosity of AGN hosted by each class of bulge, before (uncor) and after (cor) 
correction of the [O III] line flux for attenuation by dust. Luminosities are given in solar units. The final column gives the fraction of objects in each class 
for which Balmer decrements are measured, and therefore their [O III] luminosities can be corrected for dust attenuation. All values have been corrected for 
survey volume effects using the 1/V ma x method (see Sectionff). 



class 


F(Luncor) 


log((L unC or» 


F(Lcor ) 


log((L cor » a 


F(Lcor) 


log((L cor » 6 


fraction 


Quiescent 


0.28 


5.98 


0.06 


6.31 


0.12 


6.15 


0.08 


Green Valley 


0.14 


6.05 


0.07 


6.75 


0.10 


6.45 


0.20 


Star forming 


0.47 


6.36 


0.65 


7.47 


0.62 


7.02 


0.61 


Starburst 


0.03 


6.91 


0.05 


8.11 


0.04 


7.65 


0.96 


Post-starburst 


0.03 


6.23 


0.14 


7.81 


0.07 


7.10 


0.35 



a Using a single A 0,7 power-law dust attenuation correction. 
b Using a double power-law dust attenuation correction to account for differing attenuations in the birth clouds and ISM (Equation[5J. 



to calculate the dust attenuation are included in the integral without 
any dust correction; this will cause a small underestimation of the 
total dust corrected [O III] luminosity in most classes as discussed 
in Section [2. 1.1 1 (the fraction of objects with dust corrections are 
given in Table [2}. There are two main results. Firstly we can see 
that all classes of galaxy bulges are important contributers to the 
volume-integrated AGN [O III] luminosity of our sample, although 
clearly the ordinary star-forming class dominates the budget. Sec- 
ondly, due to the trend of increasing dust content with increasing 
star formation our perception of the relative importance of AGN 
hosted by each class of bulge is greatly affected by the inclusion of 
the dust correction. 

In particular the AGN hosted in post-starburst bulges are 
most affected by the dust correction, changing their position in 
the volume-integrated [O III] budget from relatively unimportant to 
contributing at a similar level to the other minority classes, despite 
their small numbers. Careful investigation of the spectral fits shows 
that this is not caused by a systematic problem with our emission 
line measures in objects with strong Balmer absorption, however 



we find that more than 30% (almost 20% for the double power- 
law dust correction) of the final dust corrected [O III] luminosity 
contributed by AGN in post-starburst bulges arises from 13 objects 
with observed Ha to H/3 flux ratio greater than ~8.6, correspond- 
ing to an optical depth in the V-band (tv) of > 5.3 (> 3.3 for 
the double power-law dust correction). At these attenuations small 
errors on the Balmer decrements can lead to significant absolute 
errors on the total luminosities, and the form of the dust law used 
becomes crucial. Additionally, more than 50% of the post-starburst 
bulges are not corrected for dust attenuation before being included 
in the calculation of total [O III] luminosity, primarily due to in- 
sufficient SNR or flux in the H/3 emission line to accurately de- 
termine their Balmer decrements. It is entirely plausible that these 
would also contribute significantly were we able to measure their 
dust contents. Ideally, follow-up IR observation is required to iden- 
tify the true dust contents of these objects, and therefore their true 
contribution to the global volume-integrated [O III] luminosity of 
type 2 AGN (see SectionliHl. 

The bottom panels of Figure [T5l show the mean [O III] lumi- 
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nosity arising from AGN in each class of bulge. Again, the huge 
difference in the post-starburst systems before and after dust cor- 
rection is notable. It is also notable that AGN contained within the 
starburst and post-starburst bulges have the highest mean [O III] 
luminosity, after dust correction, of the entire sample. It is impor- 
tant to note that the mean luminosity of the post-starbursts should 
be taken as lower limits, due to those objects we are unable to cor- 
rect for dust attenuation, and the mean luminosity of the starbursts 
should be taken as upper limits, due to contamination by star for- 
mation. 



8 DISCUSSION 

The distribution of stellar populations of galaxy bulges in the 2- 
dimensional space of 4000A break strength and Balmer absorption 
line strength is qualitatively well described by a burst scenario, with 
a mixture of brief, strong bursts and longer lived or more frequent 
weak bursts causing about half of the bulges to show signs of on- 
going or recent star formation. In the future, a full comparison with 
model galaxy populations will allow us to understand more fully 
the physical processes and timescales involved. 

Here we have concentrated on the morphological, dust and 
AGN properties of galaxies in different regions of 4000A break 
strength and Balmer absorption line strength space. In this section 
we will firstly review and collate the main observational findings 
presented in the preceeding sections, focussing on those galaxy 
bulges with ongoing or recent star formation. We will then sum- 
marise and discuss the main conclusions with respect to the build 
up of black hole mass in the local Universe and the nature of the 
post-starburst bulges. Finally, we will briefly discuss the possibility 
of breaking the well known burst age - burst strength degeneracy. 

8.1 Summary of observational results 

Figure [76l summarises the properties of our sample and our results, 
comparing the distributions of physical properties of galaxies with 
star-forming, starbursting and post-starburst stellar populations in 
their bulges. 

The classical indices D n (4000) and H5a: The post-starburst 
class has a similar D n (4000) distribution to the star-forming class, 
and therefore would be judged to have similar mean stellar ages 
using D n (4000) alone. Our classification of objects as "post- 
starburst" is very different to previous studies, as can be seen from 
the distribution of H5a (only objects with H5a measured with 
SNR>3 are included). Instead of applying a straight cut on Balmer 
line equivalent width, we identify them as having excess Balmer 
absorption for a given D n (4000), allowing us to identify weaker 
and older bursts. 

Ha luminosity and Specific SFR: The sequence in D n (4000) 
is simply a sequence in specific SFR of the galaxy. Those post- 
starburst galaxies with measured Ha emission lines have specific 
star formation rates covering the range of the starforming class. 
However, only ~40% of the total post-starburst sample are shown 
in this figure: a large fraction of the remainder can not be corrected 
for dust attenuation due to very weak H/3 lines. 

Stellar mass and stellar surface mass density: The post- 
starburst and starforming classes are hosted by galaxies with sim- 
ilar stellar mass distributions, with the starburst class having a 
slightly lower mean stellar mass enclosed within the fibre. Our sam- 
ple was defined to have p, > 3 x 10 8 M© kpc -2 , and we can see a 
tail to higher stellar surface mass densities for post-starburst hosts, 



implying smaller z-band radii. This may be an indication of recent 
major mergers in a proportion of this class: simulations of dissipa- 
tional (gas-rich) mergers suggest that remnants are more compact 
than the originating galaxies. 

Concentration and lopsidedness: are both global properties 
measured from SDSS photometry. The starforming and starburst 
bulges live in galaxies with a similar distribution of mean concen- 
tration, although FigureQT]does show a trend of decreasing concen- 
tration with decreasing PCI within the starforming class alone. The 
"lopsidedness" of the light distribution is an important indication of 
recent gravitational disturbance. Although all the distributions are 
broad, both the starbursts and post-starbursts show a higher mean 
lopsidedness in their light distributions. Such effects are expected 
to be identifiable in merger remnants for of-order a gigayear, in line 
with the likely ages of our post-starburst stellar populations. 

Attenuation by dust of stellar continuum light and nebu- 
lar emission lines: It is already well known that both measures of 
dust content in SDSS galaxies are well correlated, although with 
large scatter, and there is a strong correlation between dust con- 
tent and star formation rate. We find strong trends in attenuation 
of continuum light with stellar population, best viewed in the 2D 
histogram of Figure [T2"b. The post-starburst galaxy bulges have a 
much broader distribution of dust contents than the star-forming 
and star-bursting classes, in particular, a large fraction show very 
large Balmer decrements which impacts greatly on their inferred 
star formation rates and AGN accretion rates. 

[O in] emission luminosity: While the majority of type 2 
AGN reside in quiescent galaxies, these are in general the least lu- 
minous, with low accretion rates. Those with high accretion rates 
are hosted by galaxies with younger stellar populations. After cor- 
recting for attenuation by dust, we find that AGN hosted by the 
post-starburst and starburst bulges have the highest mean [O III] 
luminosities and, despite their smaller numbers (3.5% of the en- 
tire AGN sample), contribute significantly to the total volume- 
integrated AGN [O III] luminosity of the local Universe (~10- 
20% depending on the dust correction used). On the other hand, 
the strongest accreting black holes are hosted by bulges with all 
types of recent star formation activity, not just those which lie on 
the evolution tracks of strong starbursts. 

8.2 Black hole accretion and build-up of the stellar bulge 

Clearly, an outstanding question in astrophysics today is the phys- 
ical mechanisms responsible for the M-o relation for black holes 
and bulges. Our main conclusions relating to the recent star forma- 
tion histories of galaxy bulges and the properties of the black holes 
they contain are as follows: 

• Due to the trend of increasing dust content with increasing 
star formation rate, the effect of dust attenuation of the [O III] line 
impacts significantly on our estimation of which galaxies host the 
most rapidly growing black holes. 

• AGN reside in galaxy bulges which have experienced a wide 
variety of recent star formation histories. The quiescent host bulges 
are the most numerous, containing 45% of all AGN. However, these 
are AGN with the lowest mean [O III] luminosities. The bulges ly- 
ing on the strong starburst track (starbursts through post-starbursts, 
burst mass fractions > 1 %) are the least numerous, but have highest 
mean dust corrected AGN [O III] luminosities (Fig|15|l. 

• Putting these results together, and integrating the rate of black 
hole growth over our sample, we find that most (~80%) of this 
growth is occurring in bulges with substantial recent or on-going 
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Figure 16. The distribution of selected spectral parameters, global photometric parameters and derived physical properties for bulges with star-forming (cyan), 
starburst (blue) and post-starburst (brown) stellar populations as classified through our spectral analysis of the stellar continuum, (a) 4000A break strength 
(D n (4000)/mag); (b) H<5 absorption line strength (H<5a/A); (c) dust attenuation corrected Ha luminosity, for those objects with significant Ha and H/3 fluxes 
(/Lq); (d) dust attenuation corrected Ha luminosity, normalised by the stellar mass of the galaxy; (e) stellar mass (M»/Mq); (/) stellar surface mass density 
(/i* /Mq kpc -2 ); (g) r-band concentration (C r ); (h) i-band lopsidedness of global light distribution (Aj); (i) z-band attenuation measured from continuum 
light (A z /mag); (j) the Balmer decrement - flux ratio of Ha to H/3; (k) dust attenuation corrected [O III] luminosity for only those objects classified as AGN 
(pure AGN or Composites) from their narrow emission line ratios (/Lq); (I) [O III] luminosity normalised by stellar velocity dispersion to the power four, a 
proxy for black hole accretion rate relative to the Eddington limit (/Lq km 4 s~ 4 ). All emission line dust attenuation corrections in this plot use the double 
power-law attenuation law in EquationfJ] 



star formation. However, the majority of this growth occurs in 
the star-forming class, which show no evidence for recent major 
changes in their star formation rate. 

• At least half of the total volume-integrated AGN [O III] lumi- 
nosity is contributed by 207 bulges (0.5% of our total bulge sam- 
ple). These are found in bulges that lie in the ordinary star-forming 
region, as well as along the strong starburst track. They account for 
7% of the star-forming bulges, 15% of the post-starburst bulges and 
29% of the starbursts. 

• We therefore conclude that a strong recent or ongoing central 
starburst (possibly fuelled by a tidally-induced inflow of gas) is a 
helpful, but not necessary condition for the build up of black holes 
in the present-day universe. 

The coincidence along the strong starburst track of enhanced 
disturbance of the global morphologies of the galaxies and in- 
creased black hole accretion rates, provides strong circumstantial 



evidence for a merger induced starburst-, or post-starburst-, AGN 
connection. A full analysis will be the subject of the future paper. 



8.3 The nature of bulges with excess Balmer line absorption 

In this paper we have developed a specific methodology for recog- 
nizing galaxies that are undergoing or have undergone an intense 
episode of star formation. Our method differs from most of those 
used in the past. At the close of the paper, we will attempt to put our 
method and its results into the framework of these previous stud- 
ies. Some of these previous studies have emphasised the effect that 
heavy dust attenuation can have on the interpretation of the star for- 
mation history of bursty galaxies. We will examine this issue below. 
Finally, we will briefly comment on the possibility of breaking the 
degeneracy between the time since the starburst and its strength. 
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8.3.1 The post-starburst galaxy zoo 

As with many aspects of astronomy, the different classifications 
of similar objects used by different authors can lead to confusion. 
Here we summarise the different aspects of defining a post-starburst 
sample, with respect to the results presented in this paper. 

All the extant methods for recognizing post-starbursts make 
use of the unique signature provided by the stellar Balmer absorp- 
tion lines. The strength of these lines is not a monotonic function 
of time following a burst of star formation: rather, these lines are 
strongest in A-type stars which have main sequence lifetimes of 
~0.1 to 1.0 gigayears. Traditionally, this "chronometer" has been 
combined with a measure of the strength of the nebular emission- 
lines (typically Ha or [O II]). These are excited by O stars, which 
have a lifetime of only about seven million years. Thus, the tradi- 
tional definition of a post-starburst (strong Balmer absorption-lines 
and weak nebular emission-lines) is quite sensitive to even rela- 
tively young post-starbursts. 

Our classification scheme deliberately selects objects in a dif- 
ferent way. First and foremost, we place no restriction on emission- 
line strengths, as this would r emove all but th e weakest emission- 
line AGN from our sample dYan et all 120061) . This has paid big 
dividends. We have shown that at least 56% of our post-starburst 
galaxies host AGN, and that these are (on-average) the most lu- 
minous AGN in our sample. Preferentially selecting against those 
galaxies with strong AGN will lead to a much reduced sample and 
potentially different conclusions. Selection based on weak emis- 
sion lines also prevents the possibility of selecting objects under- 
going multiple, relativel y closely spaced starbursts. For example, 
Brinchman n et alj d2004h found that low mass galaxies classified as 
post-starbursts based on D4000 vs. H<5a had stronger Ha emission- 
lines (indicating higher current star-formation rates). Thus, whole 
sub-classes of objects undergoing similar processes may be missed 
by a cut based on weak emission-lines. 

It is also important to point out that we do not select strictly 
on the basis of the Balmer absorption-lines. Instead we require that 
post-starbursts have stronger Balmer lines than expected based on 
their 4000A break strength. This latter monotonically and slowly 
increases with time after a burst. This allows us to retain AGN and 
to select much older post-starbursts, up until the point at which 
they return to the green-valley/quiescent population at an age of 
~l-2 Gyr. In fact, Figure [T6b shows that the post-starburst class ex- 
hibits a broad range in H5a- This figure also clearly shows that the 
starburst class has a similar (though narrower) distribution in H<5a 
as the post-starbursts. The difference in our selection method lies in 
the fact that a straight cut on Balmer absorption-line strength pref- 
erentially selects young starbursts, while our method is still quite 
sensitive to older systems. This can be seen clearly in Figure [7] 
where a straight cut on H<5a leads to a very different coverage of 
the burst tracks than our method. 

The disadvantage of our scheme of not selecting on emis- 
sion lines is that it is relatively insensitive to young post-starbursts. 
These lie in the same region of PCI vs. PC2 and D„(4000) vs H<5a 
as young star-forming galaxies. Thus, it is important to recognise 
that some of the bulges that we classify as starbursts may actually 
be young post-starbursts. 



8.3.2 The stellar populations of dusty starbursts 

An intriguing property of our sample of post-starburst galaxies is 
their high average Balmer decrements, indicating very high dust 
contents (see Figure [72l . Fi gure lB6l presents images of some of the 
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Figure 17. Joint number distribution of PCI and PC2 for SDSS galax- 
ies with 0.01 < z < 0.07 and counterparts in the IRAS Faint 
Source Catalogue. In regions of low number density, individual points 
are plotted. To guide the eye, the same single weak instantaneous burst 
track is overplotted as in Figure [131 The filled circles are VLIRGS with 
logL FIH /L > 11.26 (H = 70 km s ~ 1 Mpc ~ 1 ) , t he red points would 
be classified as e(a) galaxies by iPoggianti & Wu 2000). 



dustiest post-starburst galaxies; a comparison with Fi gure lB5l gives 
the impression that they are certainly redder, some with clear dust 
lanes. 

The apparently large amount of dust attenuation makes it dif- 
ficult to obtain a clear and accurate picture of their true nature us- 
ing optical data alone, especially given the large uncertainty in the 
slope of the dust attenuation curve at high optical depths. Strong 
Balmer absorption lines imply an excess of A stars visible rela- 
tive to the slightly younger and hotter O and B stars. It does not 
necessarily imply no O or B stars are present, for example heav- 
ily dust enshrouded starbursts have been suggested as an expla- 
nation for the strong high-order Balmer absorption-li nes seen in 
many galaxies that are lum i nous in the far-infrar ed dSmail et"al] 
1 19991 ; IPoggianti & Wj|2000| ; |Miller & Owerj|200ll) . This explana- 
tion may at first glance appear to fit well with the high dust contents 
of our objects. 

To test this possibility, we turn to a sample of SDSS galaxies 
detected in the far-infrared (FIR) with the IRAS satellite. Note that 
in this sample, no restriction has been placed on stellar surface mass 
density of the galaxies, so these results pertain to the stellar popu- 
lations of the bulges of all FIR bright galaxies with optical counter- 
parts in the SDSS and in the redshift range 0.01 < z < 0.07. 

We use the same matched sample as described in 
IPasauali et al] d2005h . but for SDSS DR4, which includes 3008 
main DR4 galaxies with 0.01 < z < 0.07, SDSS spectral SNR 9 > 
8 and detected 60/im flux. A further 56 galaxies are removed dur- 
ing the PCA, due to poor quality spectra, leaving a final sample 
of 2952 galaxies. Figure [T71 presents the joint PC1/PC2 distribu- 
tion of the galaxies; overplotted is the same weak starburst track 
as in previous figures. According to our classification scheme, we 
find that the central regions of IRAS bright galaxies have predomi- 
nantly starforming and starbursting stellar populations (82%). Only 
5% of them are found in the post-starburst class and only 9% have 
quiescent or green-valley populations. 

We focus in particular on the extreme IR bright objects in 
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our sample, those classified as Ve ry L uminous Infrared Galax- 
ies (VLIRGS) by IWu et all Jl998l) and IPoggianti & Wul bOOOl) . 
Converting their FIR luminosity cut to our cosmology selects 21 
objects with log (Lfir/L r) > 11.26 (Hp = 7 0kms" 1 Mpc~ 1 ), 
where Lfir is defined in Pasq uali et alj d2005l Section 2.2). The 
position of these objects are overplotted as filled circles in Figure 
[IT] and we note immediately that VLIRGS are found in galaxies 
with all varieties of ongoing star formation. 

IPoggianti &Wul fcOOOh find that 56% of VLIRGS show strong 
HS absorption (equivalent width > 4A), but also strong [O II] 
emission (equivalent width > 5A). They classify these galaxies as 
"e(a)", and interpret them as dusty starbursts in which the contri- 
bution of OB stars to the optical spectra is greatly reduced by dust- 
obscuration of the youngest stars. Repeating this analysis for the 
21 VLIRGS in our sample, we find 10/21 e(a) galaxies, in agree- 
ment with their results; these are plotted as red circles in Figure 
|17| We note immediately that these lie in a very different region 
of the diagram to our post-starburst galaxies. With respect to the 
instantaneous burst track, they are considerably younger than our 
post-starburst class; in our classification scheme they would actu- 
ally lie i n the starburst or sta rforming class, fitting nicely with the 
results o f lBalogh et ail (2005) that e(a) galaxies are predominantly 
disk galaxies and may indeed be dusty starbursts. 

8.3.3 The IRAS properties of the post-starbursts 

We have also used the IRAS Faint Source Catalog to characterise 
the FIR luminosities of our sample of post-starburst bulges. While 
we find that the fraction of post-starbursts with 60/im IRAS de- 
tections increases with increasing balmer decrement, the median 
value for the implied FIR luminosity is only 10 10 ' 4 Lq, and only 
10 10,5 L for the ~10% of FIR-detected post-starburst bulges with 
the steepest Balmer decrements. This is significantly lower than the 
median value of 10 11 1 for the VLIRGs. 

We conclude that our dusty pos t-starbursts are signifi cantly 
different from the VLIRGs studied bv |Poggianti & Wu (2000). Our 
objects have an older mean age (less negative values of PCI), a sig- 
nificantly greater excess in the strength of the Balmer absorption- 
lines relative to typical galaxies of this age (larger PC2), and sub- 
stantially lower FIR luminosities (lower implied star formation 
rates). We therefore believe that our objects are bona fide post- 
starbursts. The true cause of the high Balmer decrements awaits 
follow-up infra-red observations, combined with detailed mod- 
elling of the dust distribution in galaxies, including in particular 
AGB stars. 

8.3.4 Strong and old, or weak and young bursts? 

The first step required to really constrain the causal connection be- 
tween the central starbursts and the strong AGN signatures is to 
break the degeneracy between the age of the last burst of star for- 
mation (strictly the time since the starburst switched off) and the 
mass of stars formed in the burst. In Figure [7] we showed how this 
may be achieved using our third index, excess Ca II (H&K) ab- 
sorption. These absorp tion lines have been introduced before by 
iLeonardi & Rose] Jl996h as a promising age diagnostic of bursts, 
however due to the SNR required for their index, it is not applica- 
ble generally to SDSS spectra. Taking Figure|7]at face value would 
suggest that there are a number of bulges with strong Ca II absorp- 
tion which must be caused by very old, strong bursts. Tracing the 
tracks back in PC 1/2 would indicate that the progenitors of such 
systems are not contained within the SDSS sample. 



However, it is premature to draw such a conclusion for sev- 
eral reasons. Firstly, as discussed in Section |6T1 the BC03 stellar 
population models show a systematic offset from the SDSS data in 
4000A break strength. As explained in Section 16.11 the first PCA 
component contains the primary correlation between 4000A break 
strength, Balmer line strength and Ca II absorption line strength 
(see Figure(2] all three spectral features are visible in the first eigen- 
spectrum). Therefore, on fitting the correct 4000A break strength 
the first component reproduces the wrong absorption line strengths, 
forcing the second and third components to compensate. This could 
cause a systematic offset in PC3 and lead to the mistaken conclu- 
sion that the bursts are stronger than in reality. 

Secondly, the instantaneous burst model is almost certainly a 
gross over simplification of the true recent star formation history of 
the bulges. We are now reaching the stage in galaxy spectral analy- 
sis where models with a physically motivated star formation history 
are required. By combining the next generation of accurate spectral 
synthesis models with numerical simulations of galaxy mergers, 
bar instabilities and the interaction of galaxies with the interclus- 
ter medium, and accurate number counts provided by simulations 
within a cosmological framework, the analysis of galaxy spectra 
to determine precise star formation histories will provide strong 
constraints on the physical processes driving the inflow of gas onto 
black holes, and the relative importance of star formation and black 
hole feedback in the evolution of galaxies. 



9 FINAL THOUGHTS 

The huge number of high quality galaxy spectra now available to 
us with surveys such as the SDSS, allow unprecedented statisti- 
cal studies of the properties of galaxies. Until relatively recently, 
both samples and the spectral regions observed were small, mak- 
ing difficult analyses which are now routine, such as measurement 
of the Balmer decrement to correct emission lines for dust attenua- 
tion. With such large quantities of high quality data, some form of 
data compression is necessary in order to cope. However, the sim- 
ple extraction of small regions with which we are familiar - such 
as the Lick indices - may be a poor way to make full use of all 
the information now available to us. Now is the time in which sta- 
tistical techniques applied to spectra, such as the one presented in 
this paper, can provide useful additional constraints on the physical 
processes underlying the galaxy spectra. Of course, as more sen- 
sitive techniques are studied new problems will be found with the 
models, but these are rarely a cause for halting such studies. 

We have developed a new set of indices specifically designed 
for investigating the very recent star formation histories of galax- 
ies, and applied them to the bulges of low redshift galaxies in the 
SDSS. The indices are general, however, and may easily be ap- 
plied to other datasets. In this paper, we have focussed on quali- 
tative trends of global morphology, dust and AGN properties with 
bulge stellar population in the low redshift Universe, showing how 
new trends can be uncovered by simply using more sensitive tech- 
niques for measuring stellar populations from galaxy spectra. In the 
future we will be able to compare results such as these directly to 
theoretical models, using spectra calculated during detailed simu- 
lations of galaxy mergers and semi-analytic simulations of galaxy 
populations within the cosmological framework. Such comparisons 
will allow us to investigate, for example, the relative importance of 
truncation of star formation and short starbursts in galaxies in clus- 
ters, of weak starbursts on the mass build up galaxy bulges, and of 
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the quenching of star formation in bulges due to AGN feedback, 
some of the key questions in galaxy evolution today. 
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APPENDIX A: PRINCIPAL COMPONENT ANALYSIS 

The standard fo r malism of PCA pr e sented here is compiled from 
iKendalll Jl975h . lEfstathiou & Fall J 19841) and iMurtagh & Heckl 
dl987l) . In all that follows vectors are represented by a single un- 
derscore, 2-dimensional matrices by a double underscore. 

Firstly, the "mean spectrum" is subtracted from all input spec- 
tra. This is not a necessary requ irement for PCA, and often not 
applied (e.g. lConnollv et aljl995l) , however it removes the require- 
ment for the first component to point in the direction of the mean 
spectrum which, due to the orthogonality constraint of PCA, affects 
subsequent components. 

The N mean subtracted spectra each M pixels long are placed 
in a data array X_ with elements Xij, where l^i^iV, 1 ^ j ^ 
M. The elements of the covariance matrix (Q) of this data array are 
given by: 

1 - 

C »= N^ XiiXik - (A1) 

The covariance matrix can be decomposed into an eigenbasis, de- 
scribed by a set of eigenvectors ({e}, principal components in the 
language of this paper, also referred to as eigenspectra) each M 
pixels long : 

Qe 3 = \ jSj (A2) 

where Xj are the eigenvalues and j identifies the eigenvector. Note 
that these are orthogonal unit vectors: 

eje k = e H e ki = 5jk (A3) 

i 

where T represents the transpose. It can be shown that e x is the axis 
along which the variance is maximal, e 2 is the axis with the second 
greatest variance, and so on until e M has the least variance. The 
principal 

component amplitudes for each input spectrum / are given by 
<h=f&i- (A4) 
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Reconstruction of the spectrum is achieved by multiplying the APPENDIX B: EXAMPLE SDSS IMAGES 
principal component amplitudes by their respective eigenvectors: 

M 



In general as the variance of the eigenvectors decrease, so does the 
useful information contained in the spectra, hence making PCA a 
useful form of data compression: in equation dA5t we would sum 
from j = 1 to ra where m « M, hence reducing the dimension- 
ality of each spectrum from N to m. The exact number of eigen- 
vectors required to reconstruct the input spectrum is unconstrained, 
and decided upon based on the dataset and purpose of analysis. 
The M or m eigenvectors can be used as a basis set upon which 
to project any spectrum (/) of the same dimensions. The recon- 
structed spectrum only contains information present in the eigen- 
vectors, which may not be a fair representation of the spectrum 
if the spectrum were not used during creation of the eigenvectors 
and/or fewer than M eigenvectors are used during reconstruction. 




(A5) 
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Figure Bl. Quiescent: Example SDSS postage stamp images of galaxies whose bulge stellar populations are old i.e. are part of the red sequence, quiescent 
cloud in PC parameter space. Each image is 1 arcmin square and PCI and PC2 values are given at the bottom. The approximate size of an SDSS spectroscopic 
fibre is indicated in the top left corner. Only galaxies with z < 0.05 have been selected. 

Figure B2. Green-valley:Same as Figure IBTI for galaxies with slightly younger bulge stellar populations (smaller D n (4000) or PCI) than those in the red 
sequence - so-called "green-valley" objects. 

Figure B3. Star-forming:Same as Figure lBTI for galaxies with star forming bulges, with intermediate D n (4000) or PCI. They are ordered by increasing PCI. 

Figure B4. Starburst:Same as Figure IBTI for galaxies with bulges dominated by light from young O and B stars. Both PCI and PC2 are small, indicating a 
very weak 4000A break and weak Balmer absorption lines. 



Figure B5. Post-starburst: Same as Figure lBTI for galaxies with excess Balmer absorption lines over that expected for their 4000A break strength. 



Figure B6. Dusty objects: Same as Figure lBTI for galaxies with excess Balmer absorption lines over that expected for their 4000A break strength and Ha to 
H/3 flux ratios greater than > 8.6. Galaxies at all redshifts are included (i.e. 0.01 < z < 0.07). 
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